CANADIAN 
DURNAL OF MATHEMATICS 


Journal Canadien de Mathématiques 
VOL. XIL-NO.t emus 


1960 OF MICHIGAN 
vhis ya - 1900 

Sur quelques séries de Lambert et de MATHEMATICS 
Dirichlet Jacques ToucHag@ARY 


Power series representing certain rational 

functions Z. A. Melzak 
Certain bilateral hypergeometric identities of 

Cayley and Orr type Nirmala Agarwal 


On connections between growth and distribu- 
tion of zeros of integral functions Q. I. Rahman 


A cosine functional equation in Hilbert space Svetozar Kurepa 


Sheets of real analytic varieties Andrew H. Wallace 
On finite nilpotent groups G. Bachman 


Finite groups which admit an automorphism with 
few orbits Daniel Gorenstein 


Sur le radical corpoidal d’un anneau G. Thierrin 
On representations of orders over Dedekind domains D.G. Higman 


Integral p-adic normal matrices satisfying the 
incidence equation J. K. Goldhaber 


Loops with adjoints W. R. Cowell 
On quadruple systems Haim Hanani 


A metrization for power-sets with applications 
to combinatorial analysis Robert Silverman 


Published for 
THE CANADIAN MATHEMATICAL CONGRESS 
by the 


University of Toronto Press 





EDITORIAL BOARD 


H. S. M. Coxeter, G. F. D. Duff, R. D. James, R. L. Jeffery, 
J..M. Maranda, G. de B. Robinson, P. Scherk 


with the co-operation of 


. B. DeLury, J. Dixmier, W. Fenchel, H. Freudenthal, I. Kaplansky, 
. $8. Mendelsohn, C. A. Rogers, H. Schwerdtfeger, A. W. Tucker, 
W. J. Webber, M. Wyman 


The chief languages of the Journal are English and French. 


Manuscripts for publication in the Journal should be sent to the 
Editor-in-Chief, G. F. D. Duff, University of Toronto. Authors are 
asked to write with a sense of perspective and as clearly as possible, 
especially in the introduction. Regarding typographical conventions, 
attention is drawn to the Author’s Manual of which a copy will be 
furnished on request. 


All other correspondence should be addressed to the Managing 
Editor, G. de B. Robinson, University of Toronto. 


The Journal is published quarterly. Subscriptions should be sent 
to the Managing Editor. The price per volume of four numbers 
is $10.00. This is reduced to $5.00 for individual members of recognized 
Mathematical Societies. 


The Canadian Mathematical Congress gratefully acknowledges the 
assistance of the following towards the cost of publishing this Journal: 


University of Alberta Assumption University 
University of British Columbia Carleton College 
Dalhousie University Ecole Polytechnique 
Université Laval Loyola College 
University of Manitoba McGill University 
McMaster University Université de Montréal 
Mount Allison University Nova Scotia Technical College 
Queen’s University St. Mary’s University 
University of Saskatchewan University of Toronto 

National Research Council of Canada 

and the 
American Mathematical Society 


AUTHORIZED AS SECOND CLASS MAIL, POST OFFICE DEPARTMENT, OTTAWA 











min: 
avo! 
a ur 
repr 
aups 


1. 


mul 





SUR QUELQUES SERIES DE LAMBERT ET 
DE DIRICHLET 


JACQUES TOUCHARD 


Introduction. Nous nous occupons dans ce travail de séries dont voici 
le type le plus simple 
= 1 — 
et de leurs analogues lorsqu’on remplace x" par n~* et y" par n-*. Faute d'une 
autre désignation, nous avons cru pouvoir appeler ces séries respectivement 


séries de Lambert et séries de Dirichlet. Une série plus générale que la précé- 
dente est 


k=l 


ov les a, sont des constantes et ot g(x) est un polynéme dont les coefficients 
sont des symboles de Jacobi (1, pp. 132-40; 4 pp 361-9). 


possédant la période 2w. 
Les expressions 


ont des propriétés multiplicatives analogues a celles des sommes de Ramanujan 
(6) d’ot l'on peut déduire deux propositions concernant les racines de g(x). 
Nous nous bornerons Aa les énoncer, dans les §§ 6 et 7, car elles résultent des 
calculs effectués autrefois par Dirichlet (2, pp. 46-50, 188-93), dans ses re- 
cherches sur le nombre des classes de formes quadratiques binaires d’un déter- 
minant donné. Les polynémes g(x) sont donc loin d’étre nouveaux. Nous les 
avons utilisés toutefois, dans les §§ 10 et 11, pour obtenir des séries analogues 
a une belle série de Ramanujan. Dans les §§ 13 et 14, nous avons donné la 
représentation par des intégrales définies de diverses fonctions examinées 
auparavant. 


1. Séries de Lambert. Soit x(m) une fonction arithmétique complétement 
multiplicative, c’est-a-dire telle que x(m)-x(m) = x(m mn), quels que soient 


Regu le 10 aofit, 1958. 











2 JACQUES TOUCHARD 


les entiers positifs m et m. Désignons par A(x) la série supposée convergente 


(1) A(x) = 20 x(n)x" 
n=1 
et soit 
(2) G(x,y) = DE aA (x*)A (9), 
k=1 


ot les a, sont des coefficients constants. Quel est le terme en x*y* au second 
membre de (2)? Comme 


(3) a,A (x")A (y*) = a , » > x(h) x(h’)x™*y"* 
h=1 h’'=1 


on n’aura hk = X, h’k = uw, que si k est un diviseur commun de ) et yu et, par 
suite, un diviseur k = d de leur p.g.c.d. A. S’il en est ainsi, le terme en xy*, 


ea’) se)» 
*“\d — 


en provenance de (3), sera 
Posons A = AX’, uw = An’, de sorte que (A’, uw’) = 1, on voit que le terme en 
xrye = (x4) (y*)’, 


au second membre de (2) sera 


> axx(2) (*) (x*)* (y4)”’ 
aia d d : ss 


ou encore, puisque x(m) est complétement multiplicative, 


b aax'(4) row yee”? 


a\4 


Soit donc 


lm 
(4) ¥(x,y) = Zz x (1 m)x'y 
(i,m 1 
ot / = 1, 2, 3,...,m = 1, 2,3... mais sont premiers entre eux, le p.g.c.d. 
A peut prendre toutes les valeurs entiéres 1, 2,3... et, par conséquent, en re- 


marquant que x(1) = 1 


- + 2 n n n 
(5) G(x, y) = avy(x,y) +... + (> aax (")) v(x ,.¥)+.... 
din c 
Considérons ensuite la série 


(6) G(x, y,z) = > a,A (x*)A (y")A (2*). 


k=1 


d. 


e- 


SERIES DE LAMBERT ET DE DIRICHLET 3 


En raisonnant comme plus haut, cherchons le terme en x*y*z’ au second mem- 
bre de (6). Désignons par A le p.g.c.d. de A, w, » et posons A = AX’, uw = Ay’, 

= Av’; d’, »’, »’ seront premiers entre eux dans leur ensemble et le terme 
cherché en 


whyta” = (x) (y4)e'(24)”" 
( aux AY) an’ "y") (x*)* (y*)” (2°)” 
ad\A 


(7) v(x,y,2) = D> x(lmn)x'y"2", 


i.m.n)=1 


sera 
Soit alors 


ot les entiers 1, m, nm varient chacun de 1 4 © mais sont premiers entre eux 
dans leur ensemble, on aura 


(8) G(x, y,2) = ayvy(x,y,2) +...4+ (x aax'(®) )r(e" A 2 i ae 


Il est clair que des formules analogues aux précédentes ont lieu quel que soit 
le nombre des variables qui figurent dans les fonctions G et +. 


2. Series de Dirichlet. Utilisons, comme beaucoup d’auteurs I’ont fait, la 
correspondance entre 
x"etn-*, yetn-", 2" et n-*'; 


l’analogue de A (x) est 


(9) -> xu 


n=1 


l’analogue de A(x*) est k~* X(s) et l’analogue de G(x, y) est 


(10) » ay Xp Xs) = X(s)X(s’) i pie , 


Posons de plus 


(11) X,(s) = > =o), q=32,3... 
=1 
et soit, d’autre part, & (s, s’) l’'analogue de y(x, y) 


(12) t(s,s’)= > xin) 


(i,m)=1 


l’analogue de y(x", y") sera n~*~*’E(s, s’) et le second membre de Il’égaligé (5) 
devient 


1 P 
E(s, ‘| <2 +3” +...+ i 2 aax'(2) +... |. 














4 JACQUES TOUCHARD 


Le crochet est le produit de 





@ 2 @ 
} LD on Xx(s+s’) par > a : 


aml 1 n=1 


en comparant a (10), on obtient la formule 
rn, — X(s)X(s’) 
(13) &(s,s’) = Xe(s+ 9) 


Un procédé entiérement semblable donnera, en s’appuyant sur les formules 
(7) et (8) et en posant 


(14) e(s,s’,s") = yo xin), 


l’ 8 sg ’ 
(i.mn)=114™ nN 


la relation 


+ ony — X(s)X(s')X(s") 
(15) E(s, s’, s’") Xa(st+s+s")’ 


Les formules (13) et (15) sont faciles 4 démontrer directement. Considérons, 
par exemple, la formule (14); multiplions les deux membres par 


3 antentl anal? 
x (k)k as s : 
nous aurons: 


x(k) 


A=! t(s s’ s’’) - x (/k) x (mk) x (nk) 
peer” 75, = 


(i,m n)=) (lk "(mk)" (nky 
x (/mn) 
(mar Pm n 


s ’ 


ot & est le p.g.c.d. des trois nombres /, m, et n. Comme les groupes de trois 
nombres entiers positifs quelconques ont pour p.g.c.d. soit 1, soit 2, soit 3,... 


on aura, en faisant dans l’égalité précédente k = 1, 2,3... et en sommant, 
, , - x (2) — x (m) - x(n) 
Kals +9 + 9") (s,',0") = SE XD F xm) F xy 
l=} m=1 n=1 


X(s)X(s’)X(s"’), 


ce qui est la formule (15). On peut trouver plusieurs autres démonstrations, 
notamment en se servant de la décomposition de X(s) en facteurs. Nous allons 
maintenant donner deux exemples trés simples. 


3. Exemples. Comme premier exemple, soit x(m) un caractére complexe 
(7, pp. 391-414) au sens de Dirichlet, pour le module 9. Le nombre 9 a deux 
racines primitives 2 et 5 et ¢(9) = 6. Choisissons la racine primitive 2 comme 
base des indices et w = j = exp(2zi/3) comme racine de x* — 1 =0. 
Alors 





on 


Si 
av 
ob 


no 


Or 


et 


N 


et 


les 


ois 


“UX 
me 





SERIES DE LAMBERT ET DE DIRICHLET o 


x(n) = w™™ et x(n) = 0, si 3\n. 
xi)=1, x2)=j, x(3)=0, x4) = 7, 
x(5) = 4 x(6)=0, x(7)=j, x(8)= 
x(9) = 
x(n + 5. x(m). 
En conservant les notations de la § 2, on a 
X(s) X(s’) 


a(s, s ie X2(s +s j 


X(s)X(s’)X(s’’) 
X3;(s + s’ +s”) 
X(s)X(s’)X(s” 


CS pete ETE a) 


t(o) étant la fonction de Riemann. Enfin, comme x‘(m) = x(m), d’od X4(s) 








t(s,s’,s") = 


= X(s), si l'on pose 


xh. 
Ch. ttoe. vet LL? "3 


E(s,, SQece S4) = 


on aura 


) Y 
tee, dn tua) @ xu X (se) X(s3)X(s $4) 


X (5; + Se + 53 + 54) * 


Si, au lieu de former les caractéres (mod 9) avec la racine w = + j, nous 
avions choisi la racine w = — j, il aurait fallu 7 variables s;, s2...s7 pour 
obtenir 


a Rss) ne Feu 
X(s: + Ss +... + $2) 


4. Comme deuxiéme exemple de fonction complétement multiplicative, 
nous choisirons la fonction \(m) ainsi définie (6, p. 254): 
n= pi'.p2..., A(n) = (-1)""™" 


On sait que 


et il est clair que 
Xo(s) = f(s). 
Nous avons donc, d’aprés le §2, les formules 


d (Im) EBsrQs) 


user Pm’ ~ ¢(s)e(s t(s +s’) ' 


et 














6 JACQUES TOUCHARD 


A(/mn) _ §(2s)5(2s")f(2s"")¢(s + s’ + 5”) 
maar lm’ n® — §(s)t(s’)e (se (2s + 2s’ + 2s”) 
On aurait un exemple analogue en prenant, au lieu de A(m), la fonction d(m) 
(—1/n) = X(n) sin (3x), utilisée par Landau (8). 





5. Symbole de Jacobi. Nous allons maintenant nous borner aux fonc- 
tions complétement multiplicatives qui sont les caractéres de Dirichlet réels 
c’est-a-dire égaux 4 + 1. On sait qu’un tel caractére se réduit au symbole de 
Jacobi (D/n). 

Pour simplifier, nous supposerons que le nombre D est positif et qu’il 
n’a pas de diviseur carré. Par suite, si P désigne un produit de facteurs premiers 
impairs, tous différents, on aura soit D = P, soit D = 2P. Cela étant nous 
croyons utile de rassembler, ici quelques propriétés du caractére x(n) = (D/n). 

x(m) est nul, si m n'est pas premier 4 2D. La plus petite période 2w de x(n) 
est: 


siD= P 


1 (mod 4), 2w = 2P, 


si D = P = 3 (mod 4), 2w = 4P, 
si D = 2P, 2w = 8P, 
(16) x(n + 2w) = x(n). 
En outre 
(17) x(2w — n) = x(n) 
(18) > x(n) = 0, 
(19) > x(n) = 0. 
n=1 


Les équations (16) 4 (19) sont valables dans les trois cas. 
Quand la période 2w est divisible par 4 ou par 8, on a 


(20) x(n + w) = —x(n), 


mais il n’en est plus de méme quand 2w est le double d’un impair: 
Si 2w = 4P, ona 


(21) DX x4k+1)=0, LY x(4k+3) = 0, 
we Qe 

(22) > x(44k+1)=0, > x(4k+3) =0. 
1 3 


Enfin, si 2w = 8P, les équations (22) ont lieu, mais non pas les équations (21). 
Dans les formules ci-dessus, il faut comprendre que l’argument 4.k + 1, 
par exemple, prend toutes les valeurs de cette forme plus petites que la limite 





et | 


Un 
d’u 


et 


exe 


1) 





SERIES DE LAMBERT ET DE DIRICHLET 7 


supérieure de la somme. II est clair d’ailleurs que x(m) est nul quand n n'est 
pas premier 4 2w. 


6. Polynémes g(x). En gardant les notations de la section précédent, ces 
polynémes, dont nous avons parlé dans I’Introduction, sont définis par 
(23) g(x) = D> x(u)x", w < Qw 

(mw, 20 )—1 
Soir p = exp(zi/w), alors il est visible que, si a est premier A 2w, 


g(o*) = x(a)g(p). 


De plus et c’est la proposition A: Si la période 2w est divisible par 4 ou par 
8, g(x) admet toutes les racines non primitives de I’ équation x” — 1 = 0. Par 


exemple, 
i x(n) = (28) Qu = 60 
Sl X ) — n ’ we ’ 
g(x) assets +e" — g* +;''- ie gO im a en 


= x" Po x + x" = x" + x + x” + x** 
x(1 — x”)(1 + x*)(1 + x”) (1 — x”) 


et léquation aux racines non primitives de x® — 1 = 0 est 


(x°° — 1)(x" + 1)(x* + 1) 
x +1 

Une telle propriété ne peut plus avoir lieu quand la période 2w est le double 

d’un impair. On a en effet, a étant premier a 2w, 





@(x) = 


ptte = —p 
et 
g(p***) = — x(a)g(p). 
a + w est un nombre pair, premier 4 w, et —p* est une racine primitive de 
x* — 1 = 0. Nous avons alors la proposition B: 


Si 2w est le double d’un impair, g(x) admet toutes les racines non primitives 
de x — 1 = 0, sauf celles qui sont racines primitives de x* — 1 = 0. Par 


exemple, si 
(24) : 2w = 42 
n 


te hm gt eg? 4 ge” — ge” — * + * 
= x" a x + x" + x") 
x(x® — 1)(x™* — 1)(1 + x8 + x® + x + x + x”). 
L’équation aux racines non primitives de x® — 1 = 0 est 
(x™ — 1) (x" + 1)? +1) _ 
x+1 


x () 


g(x) 


@(x) = 





0. 














8 JACQUES TOUCHARD 
L’équation aux racines primitives de x" — 1 = 0 est 


21 wn 
F(x) = | it) at} =@ 


et l’ona 





F(x) x —]) 
de sorte que 
&(x) 
g(x) F(x) 81) 


g(x) étant un polynéme. 
En vertu de ces deux propositions et en se rappelant que x(m) est nul 
lorsque (m, 2w) > 1, on voit que si 2w est divisible par 4 ou par 8, 


9 wt: 9; 
(24) e( exp( 22) = x(m) e( exp(2#)) 
“Ww “Ww 


quel que soit l’entier m et si 2w est le double d’un impair, l’équation (24) a 
encore lieu, sauf pour les ¢(2w) valeurs paires de m qui sont premiéres a w. 
Pour ces valeurs m = a + w, oti (a, 2w) = 1, ona 


(25) e( exe( 2 (a+ »))) = — x(a) e( exp(2#)) ; 


Nous ignorons si ces deux propositions A et B ont été énoncées explicitement. 
Comme nous I’avons dit en commengant, elles résultent des calculs de Dirichlet 
(2). Il en est de méme, quoique moins facilement, des propriétés multiplica- 
tives qui suivent. Aussi indiquerons-nous trés succintement les relations entre 
caractéres qui permettent de les établir. 


7. Propriétés multiplicatives de g(exp(2=#™)) 


——- 


Rappelons d’abord que, pour les sommes de Ramanujan 


9 4 
(26) com) = > exp( 2inmt) 
(s,@)=1 q 


on a, lorsque (q, g’) = 1, 
Cg(m) cy (m) = Coq (m). 


Les propriétés que nous allons énoncer comportent cing cas différents. 
Posons 





nol 


En 


est 





SERIES DE LAMBERT ET DE DIRICHLET g 





ll 
, ate," 
Sb 
en 

Il 

| 
ei 
a 


x() 


' x1() = (22), 
n 


soient 2w, 2w’ et 2w; les plus petites périodes respectives de x(m), x’(m) et 
xi(”). 

h, h’, h, désignant respectivement les entiers premiers a 2w, 2w’, 2w, et < 2w, 
2w’, 2w:, nous poserons 


> x(h)x", g(x) = > x’ (h’) x", 
g(x) = DY xa(hi)x"; 


nous poserons encore 


ul _ (2a >. __ faas _ 21 
p= exp\>_)» p = expla)» pr = exp\5) - 


Enfin P et P’ seront deux entiers positifs impairs, premiers entre eux, et m 
est un entier positif quelconque. 


g(x) 


1° Cas: 
a D = 2P, D=P'=3 (mod 4) 
wa. 
g(p”)g’ (p"”) = 2g:(p."). 
La démonstration repose sur 
(a) i tats 
Ph+2Ph') > \h/\i'/* 
- gieme Cas: D = 2P, D’ = P’ = 1 (mod 4) 
“ m , sm 2 m 
g(p' )g’(p"") = — P g1(p1). 
e 


La démonstration repose sur 
(ais) ~~ (BENE) 
P’h + 4Ph' P’}\h/\i' 7° 
3'me Cas: D = P = 1 (mod 4), D’ = P’ = 3(mod 4) 
9 
g(e”)g’(o"") = — (2) gi(p:). 
La démonstration repose sur 
(ae aes) - - GIG) 
Ph’ + 2P"h P/\h/\h' 
4%me Cas: D = P = 1 (mod 4), D’ = P’ = 1 (mod 4). 


g(o™)g’(p'™) = e”*gi(or). 











10 JACQUES TOUCHARD 


La démonstration repose sur ce que, si 


hy, = Ph'+ P'h+ PP’, 


(F-) - GF). 


5'me Cas: D = P = 3 (mod 4), D’ = P’ = 3 (mod 4) 


ona 


g(p”)g’(p") = — (3,) 2(1 — e”**) gi (pr). 


La démonstration repose sur ce que: d’une part, si h; = (P’h + Ph’)/2 est 


hy PP h h 


d’autre part, si h; = (P’h + Ph’)/2 est pair, 


(or) - NEVE 
hit PP’) \PP’/\h/\i'/] 


8. Il résulte, en définitive, des équations (24) et (25) que la connaissance de 
g (exp(xim/w)) est ramenée a celle de g (exp(i/w)). Celle-ci exige la connais- 
sance des sommes de Gauss. Les calculs de Dirichlet (2, pp. 188-93) montrent 
que si 








x(n) = (2) : P = 1 (mod 4); 2.0 = 2P, 


Qn 2\ = 
s\exP\5p )) = — \p) VP: 


si 


si 








Con 


ou 


(28 





SERIES DE LAMBERT ET DE DIRICHLET 11 


9. Nous revenons 4 la série (1) du §1, en supposant que x() est le symbole 
de Jacobi du § 5, ayant la période 2w. On a alors 
A(x) = -£2),., 


—*x 
ot g(x) est le polynéme (23). La série (2) devient 


(27) G(x, y) = ¥ a, 2&2, £0) 
_ i k a xe e- yo 





et la formule (5) subsiste. Cherchons quel est, au second membre de (4), le 
) coefficient de x(m)y", sans nous inquiéter de la valeur, nulle ou non nulle, 





+ BF de x(m). Ce sera 
a 
DD x(x". 
(i,m)=1 
Comme x(/) a la période 2w et que / doit étre premier 4 m, on aura 
f(x, 2mw) 
D> x()x' =“ 
(i,m)=1 1—x 
ou 
: (28) f(x,2mw) = >> x(u)x", 
a.m 1 


ou bien, si l’on veut, d’aprés les propriétés de x(x), 


(29) f(x,2mwo)= D> x(u)x’, 
(mu, 2mw)=1 
l<p<2m.wo-— 1. 


On voit que g(x) = f(x, 2w). Ainsi 


— f(x, 2qw) 
« . \ @. vs 
(30) v(x,y) = DY x(g)y* ae - 
q=1 1 — x 
On obtient une autre expression du polynéme f(x, 2qw) de la maniére sui- 
vante. Identifions les termes en a, au second membre de (27) et au second 
membre de (5), nous obtenons 


(31) $5 Pa = x*(n)y(x", 9"). 











12 JACQUES TOUCHARD 


a en déduire 
F(s) = G(s) > u(n) xe) 
ot u(m) est la fonction de Mébius. On a donc 
(32) y(x,y) = > u(k) x'(k) wie £0, 


Le terme en y’, au second membre de (32), est 


d 
_ w@xroa($) eee 


di¢@ d 


et, en comparant a (30), on trouve 


al 1-x" 
(33) x(q)f(x, 2qw) = pm wld) x*(a)x( 4) eta" a 
dig@ =. 


et, comme 


v(a)x(4) = x(q). x(@d), 
on a aussi, en divisant les deux membres par x(q), 


P F 1—<x 
(34) f(x, 2qw) = >> w(d)x(d)g(x*) la 
d\q¢ — @ 
Cette formule (34) subsiste méme si x(q) est nul car si l’on pose, pour un instant 
2qw 


1—*x 
h(x) = g(x) 7» 


le second membre de (34) n'est autre que le polynéme h(x), privé des termes 
x(u)x*, ol w n’est pas premier a g. C’est donc bien le polynéme f(x, 2qw). 


10. Séries analogues 4 une série de Ramanujan. Nous allons chercher 
une expression de la somme des trois séries 


je l 
S= > f(x", 2qw) =, 
q=1 q 
er . x(q) 
Si = > f(x", 2qw) BL, 
q=1 q 
en aioe 2( 
S:= > f(x""*, 2qw) —_ . 
q=1 


ot n désigne un entier positif quelconque et en nous bornant, pour simplifier, 
aux cas ot la période 2w de x(m) est divisible par 4 ou par 8. Dans ces cas, a 
lieu la formule (24). En outre X(s) et X2(s) auront les significations (9) et 


(11) du § 2. 


pa! 





SERIES DE LAMBERT ET DE DIRICHLET 13 
Dans la formule (34), remplacgons x par x*/*; elle devient 
- niq « q nid 1-x™ 
(35) f(x", 2qw) = Do wm 4 etx") = sore 
‘ diq@ d yj - = 
et, en substituant cette expression dans S, on voit que le second membre est 


le produit de 


par 





q=1 ] f] 
On a donc 

1 ioe P 1 = x" 1 
36 S=- x!*) 
(36) . X(s) 2d, B(x ar q 


De méme, en remplacgant x par x"/*, dans la formule (33), et en substituant 
dans S;, nous aurons 


7 at. © rm ia 2 
(37) 5; = X,(s) D> x(q)g(x"’*) i— xo 


Enfin, on déduit de (35) 


2 n ‘ a any Pca 
x'(g)f(x"", 2qu) = 2 (3) (4) x*aete *) ——sm 
d\q@ d d l om» 


1 “8 aon i-s§ 1 
(38) So = XG) ey x (g)g(x re 7? 
Faisons maintenant x = exp(2mi/2w) dans les trois équations (36), (37), et 
(38); 
le facteur 
1 = Pimae 
i- 


est nul si g ne divise pas m et est égal a g si g divise nm. Nous avons donc 


(39) ¥ sex p( 2) , 29 w)2 ool 5 a. (da? 


n° 'X(s) an 


271 
; Ded x(n) e( exp 2 i)) 
(40) > s(exo( 2) 2g) x(q) - \£@ > a. 
Ww 


s—l <<» 
q n X2(s) an 


‘Qi 
“ o.: 2 x(n) e( exr( > i)) 
(41) os jf exp{ 2%) 24a) x q) = —-- y a >» (”) ee. 
q=l <qw 


q n  X(s) ar d 











14 JACQUES TOUCHARD 


Ce sont ces trois derniéres formules que l’on peut considérer comme étant 
analogues a la formule de Ramanujan (6, pp. 55, 237) 


> cn) = _ . 


q=l qd n “"t(s) din 





Si nous supposons l’entier m premier A 2w, ses diviseurs d le sont aussi et 


x(n/d) = x(m)x(d), de sorte que les seconds membres de (39) et (41) sont 

les mémes et, par suite, les premiers membres sont égaux. C’est ce que l’on 
$ | 

peut démontrer directement. II faut observer que, lorsque n.q est premier a 


2w, la somme 
2min\ 
f\ exp\ >— ] » 2qw 
; <qw 


a un rapport étroit avec la somme de Ramanujan (26), dont l’expression arith- 
métique est (6, p. 237) 


(42) c,(n) = > (2) d. 


d\q.din 


Reprenons en effet la formule (35) et faisons 


21 
x= exp(2#4) : 
nous obtenons 
o Fr) 204) = a(exo(St:)) & o($)a(¢)a(5) 
sexo in) » 2qu)} = g\exp 2w p> *\d) *\a) *\a -¢ 

Or 

2) q n\ _ x (qn) = 
*) x(2)x(3) x(a) — *(%), 


car x(d) n’est pas nul, donc 


2xin ) ( (3 *)) (2) 
/ wf --4 - xp| —— > 
(44) sl exo in) , 2qw g\ exp\ > - x(q .n) uw) - d 


dq\.din 


21 
= e{ exp( 2 )) x(q.) c,(m). 


La formule (44) subsisterait si un seul des nombres g et m avait un diviseur 
commun avec 2w, mais son second membre serait nul. Si, au contraire, g et 
m avaient chacun un diviseur commun avec 2w, la formule (44) n’aurait plus 
lieu. 


11. Exemple. Prenons 


Ici, 


de | 


(45 


la | 


et 


(4 





SERIES DE LAMBERT ET DE DIRICHLET 15 
Ici, D = —1 est négatif, ce que nous avions exclu précédemment, notamment 
de la § 8, mais on a facilement 
1 1 1 
=]— —_ — — } ooo ™ Ze ’ 
X(s) 3 + 5 L(s) 
(45) 
we ee ale 
XAs)=lt+atoatat...= (1 2°") ¢(s) 
la période de x(m) est 2w = 4 


, ogi 
g(x) =x — x’, e( exp(2#*)) 


»s ( (2=i#)) 
= 21, g\ exp\ —— 
f(x, 4q) = } x (u)x", a < 4g 
(wu, 4q)=—1 
et l'application des formules (39) 4 (41) nous donne 
l 


= os 
(46) > j{exp{ 2%) ; ta) == = 
q=l 4q 


s—1l 
q 'L(s) p>: x(d) d 
Zz 


din 
j 2ain 
(ex ( . ") , ta) 
q=1 :, 4q : 


n 


2 


x(q) 2tx(") _ 51 () 
q (l1—2°*)¢(s) n*™™ 
: asia) ) x"(q) 21x (n) (*) , 
expi —— ] , 4g] &——* = --t : 
¥ A p22 q q n Ls) 2 \d 
On voit sur (46) que, si les facteurs premiers impairs de m sont tous = 1 (mod 
4), il en est de méme pour tous les diviseurs impairs de m et, dans ce cas, le 
second membre de (46) se réduit a 
210, i(n) 
n*'L(s)° 
Enfin, d’aprés (44), si m g est impair, 


9) ° 
j(exo( 222") 4q) = 21 x(n. q)c,(n). 


12. Nous signalons qu'on peut obtenir une généralisation des résultats des 
$§ 9 et 10 en partant de la formule (7) que nous récrivons 


I 
= > x(l)x(m)x(n)x'y"z 
(i,.m.n)=1 
et de l’équation (8) dont on tirera, comme tout a l'heure, par inversion 
(47) 


: 1(x°*) o(y") (z*) 
x(x, ¥, z) = } u(k) x(k) ~ ee i ae rok] ge 


2wk 
- 


9 " yee 
La formule (47) se préte au développement du premier membre suivant les 
puissances d’une des variables, z par exemple, et la formule (7) se préte au 








16 JACQUES TOUCHARD 
développement suivant les fonctions y(x, y), y(x*, y®),..., v(x", y"),.... 
On peut ainsi trouver des sommes et des séries analogues a celles de Ramanujan. 


13. Intégrales définies. Schlémilch (11, p. 277), en transformant une 
intégrale d’Abel, a donné la formule 


ton u” ‘du I(x) , r(y) 1 , 
(48) f (+mu)" 3 T(x+ y) F'm®’ R(x) > 1, R(y) > 1. 
Posons 
(49) P(u,o) = 2 —_— Ts | 


(‘mai (l + mu) 

nous aurons 

a r(s)T(s’) t(s)e(s’) 
50 | u*~* P(u,s +s’) du = > aan 
(90) 0 \ + ris+s)f(s+s°) 
Si nous multiplions les deux membres de (49) par ¢(c), il viendra 

- l 
51 o)P(u,c) = Q(u,c) = - —- 
(51) $(0)P(u, oe) = Qu,e)= 2 2 ara 
et par suite, 
r(s) . T(s’) 


a ton _ ’ = ° io of? 
(52) J u- Q(u, s + s’) du rs +s’) ¢(s).e(s"). 


Il semble que la formule (50) puisse présenter un certain intérét. Lorsque s 
et s’ sont des entiers pairs, l’intégrale du premier membre est un nombre 
rationnel et on peut rappeler qu’Euler (5, p. 350) a cherché des expressions, 
telles que: 


“yr 
w 
| 
to 
.. 
es) 
II 


0,05815227 . .. 
£(5) = 2°Bsye s+ . Bs = 0,02541327... 


Partons maintenant d'une intégrale qui se déduit aisément d’une formule 
de Liouville (3, p. 150): 
f° {- _u*—"y* ~"dudv r(s)P(s’)P(s’’) l 
( e 


EE —_ $+s’+5" = > 7 “yy - ri ry 
o (L+ mu + nv) Tis+s +s ) mn’ 
ot les parties réelles de s, s’, s’’ sont > 1, et posons 
l 


54 >(u,v,0) = a 
(54) P(u, 0, 0) POM py ry R(s) > 3, 


(53) 


ad 
) 


nous aurons 


ep ®ap : 
(55) f | u* py’ —"P(u,v,s + s’ + 5”) dudv 
0 0 


(s)P(s’)P(s’’) F(s)e(s’)e(s’") 
“7 a _ + a 
) 


tists +s”)° 


~ Mists +5 


a 


et, 
dro! 


cor 
bo 


les 


et : 


not 





SERIES DE LAMBERT ET DE DIRICHLET 17 





Nous poserons encore 


(56) O(u, v, o) (oc) P(u, v, o) 


> Lam 


Tat met nei (L+ mu + nv)’ 


et, en multipliant les deux membres de (55) par ¢(s + s’ + s’’), nous obtien- 
drons une nouvelle formule qu’il est inutile d‘écrire. 


' 14. En supposant que x(m) est un caractére réel et en posant comme au 


§2 


X(s)= D> x(n)n™", —- X2(s) = DS x*(n)n™, 


n=1 n= 1 
X3(s) = X(s) 


les calculs précédents s’étendent facilement. Soit 


_x(im) | -R(e) > 2, 


P , uu, a) oe? 
(x (i,m)=<1 (1 + mu) 


- __x(/mn) 
j P(x, &, 0, 0) * itt aet a 


et soit également 





R(c) > 3, 


Bs : (lm) 
X. P . ’ }= ) 4 ¥ = x ae © 
2(o) P(x, u, o Q(x, 4, @) > yi (l +umu) 


: ; x (lmn) 
xX 4 ° » ), = . . , \= — . > —--; 
(co) P(x, u,v,7) = O(x, u,v, ¢ D> 2X p> 5d tal 


nous aurons en partant des intégrales (48) et (53) 
I'(s) . T(s’) X(s)X(s’) 


; s—1 . , -_ ot? 
fou P(x, u,s + s’) du Tis 4s) Xo(s+s)’ 


) 
an tor . 

| J u* "vy" "P(x, u,v, 5s +s’ +s") du dv 
0 0 


r(s)P(s’)T(s”’) X(s)X(s’)X(s"") 
(is + s a 5’) X(s + $. +. * 


we , 2 T(s)P(s’) 2 ses 
f u’ O(x, u,s +s’) du = T's + 3) X(s)X(s°), 





| | u*"'y* —"O(x, u,v, s +s’ +s") dudv 


( _T(s’ ; (5"" P 
(s) . T(s’) Si X (s)X(s’)X(s’’) 











18 JACQUES TOUCHARD 


Prenons en particulier le méme caractére x(m) que dans la § 11, nous aurons 


I(s) . T(s’) L(s)L(s’) _ 


o-= ww s—1 ’ _ Ts) . TU ils equpemmenetien oa al 
67) J, wo PCG hs +5) as TT) 2 rs +8) 


f | u*'v" P(x, u,v,s +s’ +s”) dudv 


— Eis). TOs’) . P(s") LS) L(s') L(s"") 


lists + s”) Lis + s +s”) ; 

15. Voici quelques exemples d’application de l’intégrale (50) que nous 
désignons par I (s, s’) et de l’intégrale (57) que nous désignons par J (s, s’). 
On sait (9, p. 33) que 


‘ 2n B, 
¢(2n) = 3 (27) (2n)! 
(z)"" E 
9 =i —— 
L(2s + 1) *\ (2n)! 
B, = 1/6, B. = 1/30, . .. sont les nombres de Bernoulli E, = 1, £, = 1, 
E, = 5,..., sont les nombres d’Euler. On trouve leurs valeurs dans (10, 
pp. 176-8) 
99)\)= 5 711.3) = l 
I(2, 2) 12 J(1, 3) 4 
I(2,4) = Jn J(1, 5) = ~ 
, 7 a aa 32 
a ae a I 
13,3) = - °) J(3,3) = 35 
5 61 
2,6) = =—r J j= - 
I(2, 6) = 53°77 (1,7) = 377 
9 at ¢f ‘i 
1(3, 5) = 2~9,~* ¢(3)¢(5) J(3,5) =5 me 
~ 2.17 
= 5° 
1(5,5) = 510 $5) J, 5) = 5 ." 


On voit qu’on peut obtenir des relations algébriques entre ces intégrales. 


16. Considérons l’intégrale (52). Il n’est pas évident qu’elle a un sens car, 
pour la fonction Q(u, a), l’axe réel négatif est une ligne de points essentiels, 
y compris le point u = 0, et on voit directement qu’en faisant tendre u vers 
zéro par valeurs positives, Q(u, o) devient infini. Mais on établit sans peine 
la formule 


(58) T(c)Q(u, oc) = f r 
o le 


o—1 

x dx 
7 — a - 9 
— 1)\(e* — 1)° R(c) > 2. 








SERIES DE LAMBERT ET DE DIRICHLET 19 


En supposant u trés petit positif, on a 


a ee f 
e*—1 ux 
et 
(c)Q(u, 7) = 7 I'(o — 1)f(e — 1) 
ou bien 
_ fe — 1)1 
Q(u, co) = i 
et 
1 ¢(¢—1)1 


2 on eases Gelbeeeeeneete 
Ps, @) = oF t(c) uo 





, ‘ ra ‘ ‘ a a — 
On arriverait au méme résultat en faisant, dans (58), u trés grand positif et 
en se servant ensuite de la relation 


(2 .e) =u’. Q(u,c). 


P Les intégrales (50) et (52) ont donc bien un sens, puisque R(s) et R(s’) sont > 1. 
. & Si nous considérons I’intégrale (52), en nous servant de (58) pour ¢ = s + s’ 
nous aurons 


er ee s+s'—1 
Po x dx 7/.%\3 , 
(59) j u ‘du | ——— = I'(s)f(s)T'(s )e(s’). 
et 


o (e — 1)(e” — 1) 


Posons u = y/x et le premier membre devient 

; >, s’—1 | 
f f ey dy 0" x” dz 
0 é — |] 0 e —]’ 


ce qui constitue une démonstration directe de (59), donc de (52) et, par suite, 


de (50). 


REFERENCES 


. P. Bachmann, Zahlentheorie, Teil 1. 

——— Zahlentheorie, Teil 2. 

G. Brunel, Monographie de la fonction gamma. 
E. Cahen, Théorie des nombres, T. 2. 


oP eEo 


L. Euler, Oeuvres, Series Prima Vol. X. 
. G. H. Hardy and E. M. Wright, An introduction to the theory of numbers (2nd ed.; Oxford, 
.r, 1954). 
Is, 7. E. Landau, Primzahlen, vol. 1. 
} 8. ——— Primzahlen, vol. II. 
rs 


9 E. Lindeléf, Le Calcul des résidus. 
10. N. Nielsen, Traité élémentaire des nombres de Bernoulli. 
11. O. Schlémilch, Compendium der Hoéheren analysis, vol. 11. 


Lausanne, Suisse 


—— 











POWER SERIES REPRESENTING CERTAIN 
RATIONAL FUNCTIONS 


Z. A. MELZAK 


1. Let & denote the set of functions of a compiex variable z, regular at 
z = 0, and let J denote the set of non-negative integers. For f € & put 


f(z) = >, fat”, (2) = >a} sgn|f,|2", J, = {n\n EI, f, = 0}. 


For a given subset W%) of & there arises the problem of characterizing the 
admissible gap sets J, of functions f in %. When YW, is the set R of rational 
functions a complete solution in given by the following theorem: 


(A) Let f € Rand let I, be infinite. Then there exist integers L, L;, Lo,...,Ls, 
such thatO < LL, < L2...<L, < L, and I, = {n\n € I, n = L, (mod ZL), 
j=i,...,s} UT’, where I’ is a finite exceptional set. 


As in (2), this is simply deduced from the theorem 


(B) Let f € R and let I, be infinite. Then there exist integers L, Li, no, such 
that0 < Li < L, m > 0, and {n\|no. < n, n = L,; (mod L)} C I;. 


Theorem (A) was proved in 1934 by Mahler for the case when f has algebraic 
coefficients. This was extended to the general case by Lech in 1953; later, in 
1957 Mahler gave another proof of the general case. For references see (1) and 


(2). 
We shall prove first 


LEMMA 1. Theorem (A) is equivalent to the proposition: if f € MR then o; € R. 


In view of this one may ask the following question: let 


f= > fas ER 


= 


and let the coefficients f, be all real, put 
xs(2) = 2) sen fas”; 


under what conditions is x, € ®? Our main result proves the existence of a 
large class of such functions f and indicates some of its properties. 


2. There are several descriptions of R which we shall use. Their well-known 
equivalence is stated formally as 


Received September 18, 1958. 
20 


fi 





e 
1 


' 
' 
’ 
' 


POWER SERIES REPRESENTING RATIONAL FUNCTIONS 21 


LeMMA 2. The following are equivalent: 


(a) R ts the set of quotients P(z)/Q(z) of polynomials with complex coefficients 
and with Q(0) ¥ 0, 
(b) R ts the set of sums of the form 


N M 
P(z) + ay Dd Ag (ox — 2)~? 


j=l 


where P is a polynomial, A » and a, are complex constants, and a, ¥ 0, 
(c) R is the set of power series 


> faz", 


n=0 


regular at z = 0, whose coefficients satisfy a linear recurrence relation: 


N 
pm Cifn+y = 9, n > No, 
j=0 


(d) R is the set of power series 


> i 


n=0 


regular at z = 0, whose coefficients are values of an exponential polynomial: 


w 
Se - ys P,(n)a,", n > No, 
k=1 
where P, is a polynomial and a, # 0. 


Here and in the sequel ““7(), » > mo’’ will mean that the property T holds 
for all non-negative integers greater than or equal to mp. The bound mp will 
vary from case to case. 

Let 


and put 


(1) fog= 2» Sand". 


By Hadamard’s Multiplication Theorem (3), 


(2) (fog)(z) = 1/2mi J 40) e(s/w) dw/w 


where C is a sufficiently small simple contour about the origin. By Lemma 2, 
(d), or directly by (2),fog € Riff, g € R. It follows that under the ordinary 
addition and the multiplication of (1) R becomes a commutative algebra 
over the complex numbers, with the identity e(z) = 1/(1 — 2). 











22 Z. A. MELZAK 
3. We prove now Lemma 1. Let 
f= >, fat" ER; 
without loss of generality let J, be infinite. By Theorem (A) 


(2) = e(z) — e(2z”)P(z) + Q(z) 


where P and Q are polynomials and 
s 
P(z) = > 2”), 
j=l 


Therefore ¢, € R. Suppose now that ¢, € R. By Lemma 2, (c) 


N 
2, c,sgn|fa+s| = 0, n > No. 
However, there are exactly 2” different sequences 
sgnif,|, sgn|fasil,..-, sgnlfarw—il. 
It follows that the sequence {sgn/f,|}, 2 = 0,1,...,is periodic, m > mo. 
Since 
I; = I¢,; 


this implies at once Theorem (B), and therefore also Theorem (A). 


4. Let f € R, by Lemma 2, (b) f is a sum of a polynomial and a finite 
number of partial fractions corresponding to the distinct poles z = a,, k = 1, 
2,...,N. A pole at a, will be called pseudo-rational if a,/|a,| is a root of 
unity, otherwise it will be called pseudo-irrational. We have now a unique 
decomposition 


(3) f = P + fi + fe 


where P is a polynomial, all the poles of f; are pseudo-rational, and those of 
fe are all pseudo-irrational. A function f € ® is called itself pseudo-rational 
if in its decomposition (3) f. = 0. 

Let 


f= Do fas” € ®, e= 2 ee" CR, 


and let f, and g, be real for all m. Put 


2) co 


(4) fUge= ZX max(fa, a2. f Ng = Do min (fa, gx)”. 


n=0 


We can state now our principal result. 


THEOREM 1. Let 


f = = faz” € R 


n=(0 





ee Fes OP 





LO. 


al 





POWER SERIES REPRESENTING RATIONAL FUNCTIONS 23 


and let f, be real for all n. If f is pseudo-rational then x, © R. The set $ of all 
pseudo-rational functions with real coefficients is a sub-algebra of R, over the 
real numbers, under the ordinary addition and the multiplication of (1), and it 
is also a lattice under the operations of (4). 


5. We need first a preliminary 


Lema 3. Let 
™ 
E(n) = 2 Pr(n)ax” 
k= 
be an exponential polynomial, real for n = 0,1,.... Let the a, be roots of 
unity. Then {sgn E(n)}, n= 0,1,...,%s @ periodic sequence, n > mo, and 
min {|E(n)| |E(m) = 0} >c> 0. 
We have 
N M 
(5) E(m) = 2) Do any n'ai" 
kal j= 


where M = max, deg P,; M is called the degree of E. One can write (5) as 


(6) E(n) 2d, F,(n)n’ 


where 
N 
F,(n) = > Ay". 


By the hypothesis a, = exp 227 p:/qr,0 < Pe < Ge, (De, Qe) = 1. Let Q = L.c.m. 
{qe}, then F,(m) = F,(m + Q) for all m and 7. We can also show that F,(n) 
is real for all m and j; this follows by observing that with each pair a, P, = 
x a.m! in E there is associated the conjugate pair &, P, = + a, n’. 

The lemma will be proved by induction on the degree M of E. Suppose first 
that M = 0, then 


E(n) = Fi(n) = ) @,,03" 


so that {E(m)}, m = 0,1,...,is a periodic sequence of real numbers with 
period Q. Therefore the lemma holds here. Suppose now that the lemma has 
been established for exponential polynomials of degree < M, and let deg 
E = M + 1. Then 


(7) E(n) = Fryuyi(n) n™t! + E,(n) 


where Fy4:(m) is real for all m and not identically zero, and deg E, < M. 
Let Q be the common period of Fo, Fi,..., Fag: and consider the set 


S = { Fyii(0), Faegi(l),..-, Faesis(Q)}. 











24 Z. A. MELZAK 


If no member of S vanishes then 
(8) min|Fy4i(m)| = min |Fysi(n)| = c> 0, 

+ 0<n<Q—-1 
and the first term on the right in (7) dominates the whole right-hand side 
since |E,(m)| = 0(n™). Now the periodicity of Fy4i(m) and the condition 
(8) imply that the lemma holds in this case. 


Suppose now that some members of S vanish. For m € I let m € A if 
n = m, (mod Q) and Fy41(m) = 0, 0 < m < Q; otherwise let » € B. When 
nm is restricted to B the lemma holds as before; when  € A, E(n) = E,(n) 


and the lemma holds by the induction assumption since deg E; < M. This 
concludes the proof. 


6. We prove now Theorem 1. Let f = >(9 f,2" be a pseudo-rational function 
and let f, be real for all mn. By Lemma 2, (b) we have 


R N M R 
(9) f(z) aa P(z) + > Dy > A rkj (Ore ans zs)? - P(z) + > gr(z) 
T= = j= r= 
where |a,,| = a, and 0 < a; < a2 <... < ag. That is, we order the partial 


fractions according to the increasing absolute value of the poles. R will be 
called the order of f. Since the presence of P in (9) influences only a finite 
number of coefficients we assume without loss of generality that P = 0. 

We show first that x, € &. The proof will proceed by induction on the 
order R of f. Let R = 1, then f = g;(z) and so 


(10) fn = a3" E,(n) 


where E,(m) satisfies the conditions of Lemma 3. It follows that {sgn E,(n)}, 
n=0,1,...,is a periodic sequence, m > m, which implies immediately 
that x, € ®. Suppose now x, © ® for any function f of order < R, satisfying 
the conditions. Let f be a function of order R + 1, then 


f(z) = gi(z) + A(z) 


where the order of 4 is < R and the absolute value a; of the poles of g; is 
less than that of any pole of h. Let 


gi(z) = u gaz’, h(z) = » h,2", 


then f, = 221 + hk». Suppose that g,, ~ 0 for all ». By Lemma 3 it follows 
easily that 4, = O(g,:) for large m and therefore sgn f, = sgn gyi, 7 > Mo. 
However, by the induction assumption {sgn g,:1}, m = 0,1,..., is a periodic 
sequence, ” > mp. Hence {sgn f,}, » = 0,1,..., is a periodic sequence, 
nm > Mm, and x; € R. 

Suppose now that g,, = 0 for infinitely many n, and let » € A if g,, = 0, 
n € B otherwise. Much in the same way as in the proof of Lemma 3 we show 





ion 


2, 1S 


POWER SERIES REPRESENTING RATIONAL FUNCTIONS 25 


that {sgn f,} is a periodic sequence when n is restricted to A, and also when 
n is restricted to B, which again implies that x, € ®. 
Furthermore, it is easy to show that x, must have the following form 


xs(z) = P(z) + e(z) — e(2”)Q(z) 


where 


Qs) = D es’ 


and e, = 0,1 or —1. It follows that not only x, € ® but actually x, € §. 

We proceed now with the rest of the proof. It is clear that B is closed under 
addition and multiplication by real numbers. We show next that fogé€ $B 
if f,g € $. Although this follows immediately from Lemma 2, (d) the follow- 
ing proof supplies a closed explicit representation for f og. As in Lemma 2, (b) 
let 


N 


fie) = P(Z)+ & . Agle — sy", 


Mi Ni 
g(z) = P,(s) + yy > By(By — 2)~’, 


then 


N Mi Ni 


(11) fos= Qs) + > ) } Zz A xB jyx, (on -s)“o (Bx, — 2) . 


j=l kiwl jimi 
where Q is a polynomial. Now 
~ — — j— 1 i l ny n+ n 
(12) (a, — 2) 4 © (Be, — 2) eo >» (” i (" bi eet iget a, 


Let constants Yp¢, Ss = 1,2,..., + q — 1, be determined so that 


nte~tate~*). ¥ ek. 
( n n = 2D oe n 


identically in m. Then by (12) 


ji+j-1 


(13) (a - zs)? ° (Bx, - z) = 7 Vinee Br, (anBy, — 3) Pa 
s=1 


By putting together (11) and (13) we obtain an explicit representation of 
fog and see at once that fog € §, since 


ar Br; — Be; 


lareBe, | - lex, | 1Be,| 








By (4) 


fUg=1/2 a [fn + on + (fa — gndsgn(f. — gn) Je” 


1/2[f + ¢+ (f — g)xr-0l, 











26 Z. A. MELZAK 


and ff\g=f+ge—fUg. Since f — g € $ implies x,, € $, it follows 
that if f,g € $B then fUg € BP and ff\g € $. This completes the proof. 


The author acknowledges gratefully suggestions and criticism of Professor 
K, Mahler of Manchester University. 


REFERENCES 


1, C. Lech, Ark. Mat., 2 (1953), 417-21. 
2. K. Mahler, Cambr. Phil. Soc., 52 (1956), 39-48. 
3, E. C. Titchmarsh, The theory of functions (Oxford 1939). 


McGill University 



















CERTAIN BILATERAL HYPERGEOMETRIC IDENTITIES 
OF CAYLEY AND ORR TYPE 


NIRMALA AGARWAL 


1. Recently I (1) gave some new basic hypergeometric identities of the 
Cayley and Orr type with the help of a certain basic differential operator. 
The present paper deals with some bilateral generalizations of those identities 
together with certain new identities of the same type. In § 4 is indicated how 
the generalizations of Orr’s identities given recently by Shukla (8, Theorems 
I, 11) may be connected with each other. Later in § 5 certain general ex- 
pansions of hypergeometric functions are deduced. The following notation has 
been used throughout this paper: 


” = (i a ’ oe 


(qg*;n) = (a;n) = (1-q)(l-¢ ), (a; 0) = 


(— \" qre/¢ "“(q i—a, 


;m), lqi <1 


(qd; 
(a) _» = (-)"/( — a)n 
ee (ai; n) (a2; nd . (a,; 2) x" 
e(entneoortg) | 
Da, bs, ..., 0, 24 (bi; 2) (bo; mm)... (b,; m) 
| < is \Dybe . ° . b,| < lay@o.. » AyX| < s. 


G1, d2,.--5Gr, = Fi (in(Ga)n --- Gr)n on deal os 
nf :) - z G).0)....6)°° 7! 


@;, @2,...@, — (a1; m) (a2; m) . . . (a,; m) 
ont =e fh ine | 1. ae x l 
re (2 bs, a) > (1; 2)(b:; m)(bo;m)...(b.;) > Ix] < 


i te oa le — (1)n(G2)n..- (Arn nn : 
3 -x) = et lel 
F(a Rc coe :) u ie... «| 


r(% = : = I'(a;) (a2) . . . T (Gm) 
= Saas T'(b;)0 (be) . (db, )’ 


. (° b )- (a)m(6)m 
c,d’ (chstehe 
(1 —q')(1 at ie Ce (1 aa ” ieee * 


where 6 = x 0/0x, and 


—n) = (a; —n) = 





where, for convergence, |x 














(6; n) 


Alb) = Sta? om HS) 











Received December 15, 1958. 








28 NIRMALA AGARWAL 
where I’, (x) is a basic gamma function defined by Jackson (7) and 


_ F@+A) 
44) = r(h) ~ 


2. Three bilateral hypergeometric identities. We now proceed to 
prove the following three identities that: 


(2.1) (c;m)(d — b;m)(a + c — 2m; m) A,(c + m) 


d—b+~m; *) ( +c—m,b; — 
x w.(4 + m 2V2 1 rt ms, ¢ 


= the same expression with c and d interchanged. 


(2.2) (f — b;m)(d;m)(1 — a;m)(e — c;m) A,(d + m) 


f—b+m; ql (o b,e —c +m; xq° ‘) 
x wl’ +m vs 1 + m, d,e 


= (d — b; m)(f;m)(1 — c;m)(e — a;m) A,(f + m) 
d—b+m; "4 (< b,e —a +m; xq" ‘) 
x w( +m aVs 1 + m, e, f 
provided a + f =c+d. 


‘;m) , 
(2.3) Oe is — A,(c + m)A,(c’ + m) 


(c — a;m)(c’ — b’; m) 
a, b; xq° ~) (< b’; x ) 
x w(’ +m,c Ws 1+m,c’ 


—_ = _— a’;m)(1 — b a b’; m ) (a+b’)m , ? 
a. (1 — a’; m)(1 — b;m) q A,(a + a’) A,(b + 0’) 


’,c—a+m;x ) i — b'+m,a;x ” ineee F ") 
x w(* +m,b+ b'—m Ws l1+m,a+a’'—m 


provided a + a’ +6+0' =c+c’ + 2m. 
The identities (2.1, 2.2, and 2.3) are generalizations of certain earlier results 


(1, 2.1, 2.2, 2.3). 





Proof of 2.1. Consider the known identity (1, 2.1), namely, that 


(2.4) A,(c):Po(d — b; x) .0,( + ¢, 6; xq ) 


equals the same expression with c and d interchanged. 





to 


ults 


BILATERAL HYPERGEOMETRIC IDENTITIES 29 


Comparing the coefficients of x* on both the sides we get the transformation 


a+c,b, —n; ao) 


(2.5) (c;m)(d — 6; nm) .0,(¢ l+b-—d-n 


a +d,b, —n;q'* *) 
dli+b—-c-—n , 
Replacing n, a, b, c, and d respectively by 2m + n, a — m, b — m, c — m, 
d — m in (2.5) we find that 

(2.6) (c:m+n)(d — b;:m+n)(a +c — 2m;m) 


a+c—m,b, —m—n;q'* a 
xadets~ =o ~o- md 


= (d;n)(c — 6b; n) so,( 


equals the same expression with c and d interchanged.* Hence on comparing 
the coefficients of x" and using the relation (2.6) we find that 


(c;m)(d — b;m)(a + c — 2m;m) A,(c + m) 


ities Btenahes ) 
x w(4 +m Ps + ws, c 


equals the same expression with c and d interchanged, which proves (2.1). 
Putting m = 0 (2.1) gives (2.4). 

Proceeding exactly in the above manner we can prove the identities (2.2) 
and (2.3). Putting m = 0 in (2.2) and (2.3) we get back to the known identities 
due to Agarwal (1, 2.2, and 2.3). 


3. Certain new identities and their generalizations. In this section 
we prove three new identities and later deduce their bilateral generalizations. 
The identities are 


- —— a,b,c; x ) 
(3.1) A(t +a — €) sFole OE scans 


n , oa ae aa ott ett h han 
A(e) :Fo(1 + a4 2c; x) (3, aot tne ye 


and 


a, 4a + 1,6, c;x ) 


ta, l+a-—b,1l+a-—< 


(3.2) A(l1+a—c) ,Fole — c;x) Pf 


. 7. Be 2b, a, 
= *o — 26; = ae ’ 
A(e) Fol +a c ones —* 


*The limiting case as gq— 1 of (2.6) can be obtained directly by putting ¢c = m+n+ 1 
and replacing 2 — e, 2 — f, b, d, and a@ respectively by c,d,1 — 5, 1 m, and 1 +a — 2m 
in a known result due to M. Jackson (6, p. 34). 











30 NIRMALA AGARWAL 


provided 1 + a — c — e = 20 in (3.1) and (3.2), and 


3 -% . (a, 4a + 1,b,c;x ) 
(3.3) A(1 + a — c) iFole oar :.-. 


—2b-—1,4¢4+4-—6,-—b—1,c;x 


= A(e) ,Fo(1 + a — 2c; 2) P(E 1 “ig pa 


provided a — c — e = 2b. 


Proof of (3.1). It is easy to see that for suitably restricted parameters we 


have (4) 


a rare 2 f(x) = f u®*(1 - u)* sp f (xu) du. 





Let us replace c and d respectively by 1 + a — ¢ and e and take 


: \ pla,5,¢;% 
f(x) = sFole — 0:2) aF(‘ +a-b,1+a- 2 


Then the right-hand side becomes 


1 
a—c “ e+c—a—2 aii ce pp (4, b,c; xu ) 
fu (1 — u) (1 — ux) (4 Lawhtteus du. 


) 


Expanding the ;/, series and interchanging the order of integration and 


summation (which is easily justifiable), we have 





— (a),(b),(c), SS nf a+Tr—c = e+c—a—2 = .\c-e 
u Mite-bii + a=," _™ e! u) (1 ux)” “du 
-y —__@ (6) x rl +a—c+r)Ple+c—a—1) 
—_" ri(l +a — b),(1 +a-—c),’ (ie +r) 


ales +a aan | 


e+r 


Using Euler’s identity (Tract 1, 1.2, 2.) we get 


ril+a-—ce)l he +c —ea— }) 





(3.4) — Te) —— ,F)(1 + a — 2c; x) 
(a) (b)(C), es eters —-a- is) 
ee a hea . 
Now, we have the transformation 
e-hes*-5 —b,c; ’ 
—-bl1+a-—),e 
of r(b)-(c), -pliteete—e—tis 
Ye “Ne+tr 


) 


pr 
co 


an 


or 


pI 





ve 


nd 


ew 


BILATERAL HYPERGEOMETRIC IDENTITIES 31 


provided 1 + @ — c — e = 2b, which can easily be obtained by collecting the 

coefficients of x* and using the known summation theorem (2, §§ 4.5, 1.2). 

Hence using this transformation in (3.4) we have the required identity (3.1). 
To prove (3.2) and (3.3) we proceed exactly as above with 


= si a, ja + 1, b,c;x ) 
I) = Hele — ¢:2) ris l+a-—b,1+a-—-c/’ 
and use the transformations (2, §§ 4.5, 1.3, 1.4) giving the sum of a nearly- 
poised aF3. 
We can also obtain the basic analogue of the identity (3.3) in the form 


q° qe" _ p,hetl qg q°; xq" 
(3.5) A,(1 + a — c) 1o(g"“; x) (fy ~¢'*, git? git*-* ) 
a—2d—-1 jat+i—d _ _jat+i-—d _—d-1 ec. 1l+a—2¢ 
= A, (e):®0(q'**”*; x) rs ae —¢ A ’ q ’ xq ) 


je4—0 qit*> g 
provided a — c — e = 2b. 
To prove (3.5) we use the basic integral 
r,(6 + c)l,(d —c 
r,(6 + d) 
used in an earlier paper as well (1), where (1 — g* x)~* means the basic bino- 
mial expansion 





1 
(3.6) ) &(x) = fu — uq)***@(xu) d(qu) 
0 





(1 — g*) (1 — g*)(1—g**’) , 
14+ —L Se ee 
*Ua-e8°" O-pa—-e5" * 


or ;Po (a; x). 
Replace c and d respectively by 1 + a — ¢ and e in (3.6) and take 
a _jat+i ja+l Db ee a—2b—2c¢ 
- Gq; »~Q 159459 ;5%q 
(x) = ,Bo(q* *; x) of j je l+a—d 1+a—c ). 
o.-¢ a + 
Proceeding as for (3.1) we obtain, on using a known summation theorem due to 
Bailey (3, § 3 (3)), the required identity (3.5). 
It may be noted that as a consequence of these identities we get certain 
interesting relations between two terminating nearly-poised series. From (3.1), 
(3.2), and (3.3) respectively we get 


(3.7 p(o be 9 ) 
Py on l+a-—b1l+a-—cl+c—e-—n 


(1 + a — 2c)n(e)n alert + 4a — b, —b,c, —n; ) 
4 


ta — b6,1+a—56,e,2c—a-—n 





(lt+a—c)(e—c),’ 
provided 1 + a — c — e = 2b, 
, 4a + 1,6,c, —n; ) 
a,it+a-blt+a-—-clt+ec-—e-—n 


— _ (L+ a —2)n(en “(2 — 2b, —b, c, —n; ) 
~ (l+a—c),(e—c),* \+t+a-—5b,¢,2c—-a—n 


(3.8) (2 
2 

















32 NIRMALA AGARWAL 


provided 1 + a — c — e = 2b, and 


a, 4a + 1, b,c, —n; ) 
ta,l+a-—-bl+a-—-cl+c-—e-—n 


_ _ (1+ a — 2)a(e), r( 
~ (l4+a—c)s(e— cn” * 
provided a — c — e = 2b. 
The basic analogue of (3.9) may be written as (from 3.5) 


q qe —q***" q’ q° q"34 
(3.10) of ja ha hes Mans it.) 

Qq,-qd .q »q q 
_ | axe n)(q°; n) q* ~ gist > —gtt q b- . q‘, q*; q 
ts s n)(q°°; n) 6*5 * ated —-get oo. q‘, q** a—n 
provided a — c — e = 20. 

Next we deduce the bilateral generalizations of the identities (3.1), (3.2), 
and (3.3). In the known transformation due to Shukla (8, 2.2) let us take 
M = 3,N = 1,q = } = land q = 0. This gives us a relation between three 
non-terminating nearly-poised ,H, series and a terminating 4F; series viz.; 
(3.11) (l-— £E)(1 — F) 

” en ac a paplaaea ii ) 

2+a-—-b-—-D,2+a-—c-—-D,2+c-—e-—-D,D-a,D-b,D-c 
(D+e-—c—1)n 


(3.9) F 





ee RF — ee ls 
l+a-—b,¢e,2c-—-a-—n 














re (D). 
Polhill “> Blinn ahi ) 
2-—-D,2+a-b-—-D,2+a-—-c-—-D,2+c-—e-D-n 
+ idem (D; E, F) 
D, E, F,2 — D,2 — E,2 — F; \ 
Ms oc ceueeadl 
€e—C), ~{a,b,c, —n; 
elt teghe hens 48.) 


Transform the ,F; series on the right by (3.7) and then replace n, a, b, c, and 
e respectively by 2m +n, a — 2m, b — m, c — m, and e — m. After some 
simplification we find that (3.11) may be written as 


(3.12) (1— £)(1 — F) 
l+E-D,1+F-D,D-—E,D-—F; 
n2+a-—-b-—-D-—m,2+a-—-c—-D-—-m,2+c—e-D, 
D—a+2m,D-—b+m,D-—c+m 


(D + ¢ — ¢ — 1)2m (2 +e~-c-1+ ail 


x (D)» D + 2m 


BILATERAL HYPERGEOMETRIC IDENTITIES 33 


fie ta 8 ~ tet 6h B= m8 +00 me 
\2— D.2+a-56—-D—m,.24+e-c-D-—--m 


+ idem (D; E, F) 
= D,E, F,2 —D,2 —E,2 — F; 
7 l+a-—-b—m,1+a-—-c—m,1+c—e,1—a+ 2m, 
l1—b+m,1—c+m 


Say Ser + gape he yaad aataalits 
™\l, ja —b,b-—a,c-a,l+a-—c 


~—_—__—__Ale + m) 
A(l+a—c+m) 

tof tag, 9h ey ada 

+m,}a—b+m,1+a—b,e 


1+a—2c+m.-. ) 
{ite on de 


x AG 


provided 1 + a — c — e = 2b — 2m. (3.12) reduces to (3.1) when D = 1 and 
m = 0). 

In exactly the same manner bilateral generalizations of the identities (3.2) 
and (3.3) may be written in the form 


(3.13) (1-E)(— F)\(i—-G) 


et 8 = 0442 -Rt+0- AP RP & 
I1+4a —-D—-m2+a-b-—-D—-m2+a-—-c—-D—m2+c—e-D, 


D—G; | 
D—a+2m,D—-—}ta-—-1+m,D-—b+m,D-—-c+m 


D+e—c—1)m ‘rip laliaiiaea) 


(D)»  \p+2m 


Ee Ba oS 8 m4 Be 
’"\2 —D,1+ 4a — D—m,2+a-—b—D—m,2+a-—c—D—m 


+ idem (D; E, F, G) 


_ SD, Ez, F,G,2 — D,2 — E,2 — F,2 —G; 

7 l+a-—-b—m1l1+a-—-c—m,1+c—e, 4a —m,1—a-+ 2m, 
—ta+m,1—b+m,1—c+m 

pase hae h at eae A(e + m) 


ldb—a,c-—-a,l+a-c AQ + a —c+m) 


x 


> tthadliaas a Bie +0 + tens) 
alt HN lta — be 











34 NIRMALA AGARWAL 


provided 1 + a — c — e = 2b — 2m, 
(3.14) 


=f 


D, E, F,G,2 — D,2 — E,2 — F,2 —G; 
l+a-—-b—m,1l1+a-—-c—m,1+c —e, 4a — m,1 — a+ 2m, 
—ta+m,1—b+m,1—c+m 


ih (oi+s-%0—#- 1,-b—-1+m,1-eb+4-8) 
™1,b—a,c-—a,lt+a-c,4a—}4-— 65 


os Ale + m) — J 
Ail+a-—c+m) 


l+a-— maton 
x mae +m 


x ee eet -tm.-b~14On4, ) 
e"°\1 + m, 4a — 4 —b +m,1+a—d,e - 


provided a — c — e = 2b — 2m. Putting D = 1 and m = 0 in (3.13) and 
? 


(3.14) we get back to the identities (3.1) and (3.2). 


4. Next we show how the bilateral generalizations due to Shukla (8, Theorems 
I, I1) of Orr’s identities (Tract, § 10.1 (2 and 3)) may be deduced from each 
other by the use of the following identity: 


(4.1) 
(1 — a)m(1 — b)m(1 — a’)m(1 — 0 n{% x) att ( 2's x) 
O)m ss al )mzit 1+m,c’’/® "\1 + m,c’* 
— (c _ @)m(c a b)m(c’ = a’) m(c’ — b’) mn 
loyalties [i= 4a -¥ +o, ) 
CAG + m,c i ) Hs 1+m,c’ = 
withha+a’ +64+0 =c+c’ + 2m. 
The identity (4.1) may be obtained from the known identity due to Chaundy 
(4, 25). 
The identities due to Shukla can also be written in the form 
(1—E)r 1l+E-D,D-E; 
(4.2) oy + 2c — D — 2m, 8/2+-0+56—-c—-D—-m, 
D — 2b + 2m, D — 2a + 2m 
(e + D-a —b+4+ m)om (< +D-a-b-—}3+ anal 
x (D) om noe D+2m 


Pea 2 tnt £2 ~ te) : ai 
Hi? ~BRi4&~D—~G& + idem (D;E 





-m) 


and 


rems 
each 


) 


undy 


BILATERAL HYPERGEOMETRIC IDENTITIES 35 
[ae ~ 9:8 | 
2c — 2m, 4 +a+6—c— m,1 — 2a + 2m, 1 — 2b + 2m 


r $6 eq berber! stable 
Lllgte4st+el—c4s-—c 


At ite) pfoe , jake 0+ 8+ e2-b4h +0, :) 





A(c + 4 + m) 1+m,c+1 
and 
1+E-D,D-E; 
(4.3) Oa Dam Stat e— Dm 
1+ D — 2a + 2m, D — 2b + 2m 
(c+D—a—b—}+m)m Segal 
x (D) om ui; + 2m w 
2a — 2m — D,2b6+1-— D — 2m ‘ 
ag ath 2) + idem (D; BE) 
of Ba- KS ~ hi ) 
~ Zc — 2m — 1,4 +a+6—c — m,2 — 2a + 2m, 1 — 2b + 2m 


pF heal peiiaitians A(c + m) fo? x) 


™\1,1,3/2-—c,c-—}34,l—e A(c -— +m)’ oH 1+ m,c’™ 


gie— et 4 +m, Wea trs ee, x) 
eA + m, ¢ 


Now let g — 1 in (2.3) and then use the identity (4.1); to transform the right- 
hand side put a + a’ = b+ b' = c+ 4+, and c’ = ¢ + 1. This gives us 
the transformation 








alae + 1} +e— 51-0) Set ED 
xan, ao, :) 
= Tab +e a,b +64+6,4— a,c — 0) tA Em) 
AE ee ote). 


Using re on the left of the above equation we find that it reduces to (4.3) 
with a + 4,¢ + 4 for a and c. Thus (4.3) is connected with (4.2). 


5. Certain general hypergeometric expansions. In this section we 
obtain certain general expansions both for ordinary and bilateral hyper- 











36 NIRMALA AGARWAL 


geometric series with the help of transformations deduced in previous sec- 
tions. Before proceeding to the actual deduction of the general expansions we 
prove a lemma which is a bilateral generalization of a known transformation 


due to Chaundy (4, 42). 
Lemma. If 


Gy, ccc F " ; oe i 
¥(‘ +m,c,.. ist) and “(4 +m,c’,.. ia) 


are two, basic bilateral hypergeometric functions (of any order) and h, k, any two 
suitable‘constants then 


(5.1) 


(h + m;m)(k; m) ata , ig | of x | 
(k + m;m)(h;m) A,(k + 2m) iia? Fo Pr 


— (k — 1;1r)(Rk; 2r)(k — hj r)[((h + m; r)|"(a; ‘| ee FL 





ae «OCR i. 2r)[(k + m; 2r)\ "(cr we ae 
x “?” r(r—1)+(A+h) r TOe+té€+ f,... - x ‘) 
os k+m+2r,l1+m.ct+r,...’°4 
erent +s... .x) 
k+m+2r,1+m,c'+r,...’°J7° 


Proof: \t is easily seen that 
A,(h) (<: smh : - (t«. see. :) 
Ae(k) © a a, = k,c ae 
and 


(4m)r (—6 — m;r)A,(h + m) (* ea xq) 
q (6+ m+ k;r)A,(k + m) itae... 


_ (a;r)...(h+m;r) _ 
~ (cr)... (e+ m; 27) 4 


eeeetests..: .«@) 
kt+m+2r.l+mc+r,...'! ‘ 


Using the above transformation on the right-hand side of (5.1), it becomes 

















T,6+h+m)0(o+h+m) | Ee +=) f 
T.(0+k + m)I,(o +k +m) LIT,(h + m) 


x an 9) — oni — . F 
x > 7. 1; r)(k; 2r)(k — h;r)(—0 — m;r)(—@ — m; ) (046-4+2m+h) 


cao (13 7) (Rk — 1; 2r)(h; r\@+k+m; r)(o+k+m:r) 


Pp ae "te 
a(t on ‘ aad) o(2 ee ae 





ge 








BILATERAL HYPERGEOMETRIC IDENTITIES 37 


where 6 = x 0/dx and @¢ = x 0/dx operate on the first and second series 
alone respectively and hence g*** = g*. Summing up the above well-poised 
s®; series we get the required result (5.1). If we take the limit ¢ — 1 we get 


the corresponding transformation for ordinary bilateral hypergeometric series, 
viz. : 


: (h + m)m(R)m A(h + 2m) is ces Se 
(5.3) —— , E se iz , :x 
(R + m)m(h)m A(k + 2m) 1+m,c,... Lo. 9 


(k — 1),(R)2-(k — h) ((h + m), é (a), --- (a"),.- - 2s 














A ri(k — 1)o,((k + m)o,) (h)-(c),... (e),... 


pire tents... e)a(btetye tr... x) 
l+m,k+m+2r,c+r,...’ l+mk+m4+2r,c+r,... °°) 


Applications. Applying the transformation (5.1) to the identity (2.1) we 
get 


= d—b+m. , is - a vet) 
(5.4) wi(t7> ;x]} oWe Aims ;x@ 


(c — b; m)(a + d — 2m; m)(d — m; m) 
(d — b; m)(a +c — 2m; m)(c — m:m) 





x ) 2 (c—m—1;r)(c—m; 2r)(c—d; r)(d; ir) (c—b+m; r)(d+a—m;r) 


wt (i: «s Saemgr gonm 2r)(c: 2r)(c; 2r)(d — m; r) 
2r_r(r )+(d—a—b+m)r 1 — ° 


- deeeesavinn b+r. sa 
' 1+m,d+r,c+ 2r q 3 


Similarly, applying the transformation (5.1) to (2.2) and (5.3) to (4.2) and 
(4.3) one can obtain three other expansions of similar type. 
Next, applying the transformation (5.3) for m = 0 (4, 42) to the right-hand 


side of the identities (3.1), (3.2), and (3.3) respectively, we get the following 
three expansions 


er a, b,c ’ 
(0.9) MAIS abe) 


at (2 r(1 + a — 2c),(a — 26),($a — b+ 1),(— b),(c), 
— sre (1 + a — c)2,(4a — 5),(1 +a — B), 


pets f ) 
~ l+a—c+2’" 


as ga —b+1+7,-b+7r,c4+7r. r) 
. la—b+r,1+a—b+r7,1+a—c+2r’ 








x 


xX 








38 NIRMALA AGARWAL 


with the condition 1 + a — c — e = 2b, 








“ a, ja + 1, b,¢ 

- Meteo) | 
->5 (1 + a — 2c),(a — 2b) )e(—b) (Cc), a 
— ae Tae it+e-h," 


xr (otee te a) eho Str. ~dtee+s ) 
A 46-64" l+a—b+7,1+a—c+2r'" 


with 1 +a —c —e = 2b, 








(5.7) 
-£ (2b — 1),(1 +a — 2c),(a — 2b — 1),($a + 3 — B),(— b—1),(c), 2% 
—t ri(a—c+r)(1 +a —c)2,(3a — } — b),(1 +a — 5), : 
‘ c+r,1+26+,1. :) 
naetet tes 7. 


r{2- 2b — Ltn det tbr bo ltnete,) | 
“Nis — 3 -54+7,14+6—-¢c4+%,146-547r ** 


provided a — c — e = 2b. 
The basic analogue of (5.7) may be written as below (by using (5.1) for 
m = 0 in (3.5)) 


—1 1+ 2¢ 2p—1. ha+ b 
rq sr) (¢ :r)(qi** 
= r a—c+r_ i+a—¢. 5. ja—j-—5__ 


=o (g;7r)(q ‘¢ Y(q - 2r)(q ;7) 


+7) 


1 ¢c 
>r)(q ts 2r_12+(e+a—2ce)r 
2 © 


— a 
, e c 
XK 2Pil teeter = XG 
q 
tn 2 
g - 
x 10, 
with a — c — e = 2b. 
Similar expansions could also be obtained from results due to Shukla (8, 


vii) and Henrici (5, a ~ b, a ~ c). 


I am grateful to Dr. R. P. Agarwal for his kind help and guidance during 
the preparation of this paper. 








BILATERAL HYPERGEOMETRIC IDENTITIES 39 


' REFERENCES 


. N. Agarwal, J. Lond. Math. Soc. 34 (1959), 37-46. 

W. N. Bailey, Generalized hypergeometric series (Cambridge Tract, 1935). 
J. Lond. Math. Soc., 22 (1947), 237-40. 

T. W. Chaundy, Proc. Lond. Math. Soc. (2), 50 (1949), 56-74. 

P. Henrici, Pacific J. Math., 5 (1955), 923-31. 

M. Jackson, J. Lond. Math. Soc., 27 (1952), 116-23. 

F. H. Jackson, Amer. J. Math., 32 (1910), 305-14. 

. H. S. Shukla, Quart. J. Math. (Oxford), 10 (1959), 48-59. 





eI eee ep 


Lucknow University 


) for 


a (8, 


uring 











ON CONNECTIONS BETWEEN GROWTH AND 
DISTRIBUTION OF ZEROS OF INTEGRAL 
FUNCTIONS 


Q. I. RAHMAN 


1. The following theorem was proved by Paley and Wiener (4, p. 70; 1, 
p. 136). 


THEOREM 1. Jf f(z) 1s a canonical product of order 1 with real zeros, and 
f(O) = 1, the conditions 


(1) lim f x * log|f(x)\dx = — x’ A, 
and 
(2) lim r'n(r) = 2A, 


are equivalent. n(r) denotes the number of zeros of absolute value not exceeding r. 


Instead of assuming the zeros to be all real Pfluger assumed that the zeros 
are close to the real axis and proved the following theorem (5 or 1, p. 143). 


100° fh (1-2) oo) 


be an entire function of exponential type, with f(0) = 1. Then the conditions 


THEOREM 2. Let 


(3) lim r'n(r) = D, > r,*|\sin @,| = rC < 
Tc n=1 

and 

(4) lim f x log|f(x)|dx = —rI¥+ @ 


are equivalent, and D = 2C + 21. 

For a general order p(0 < p < 1), the following theorem was proved by 
Boas (2). 

THEOREM 3. If f(z) is of order less than 1, all its zeros are real and negative and 
f(O) = 1, the conditions 
(5) lim r °n(r) = A, 


la 








Received July 2, 1958. Research supported by the National Science Foundation. 
40 


— 





nd 


ros 


ind 








————— 


ZEROS OF INTEGRAL FUNCTIONS 


and 


(6) i) x~**{log|f(—x)| — xcot ro-n(x)}dx ~ x A(p — 2) (cot xp 


—cot ma\r"” 


(for any «, 0 < o < 1) are equivalent. When o = p, (6) is to be interpreted as 
(6’) f x~** {log |f(—x)| — cot mp-n(x)}dx = — x’ A cosec’ xp. 
0 


In Theorem 4 of this note we extend the result of Boas to the case where 
the zeros do not necessarily lie on the negative real axis but are close to certain 
lines. 


THEOREM 4. Let 


(7) f(z) = [] (1 - £) 
Zn 
be an entire function of order less than 1. If 


(8) > {2 sin (@, + r)o + sin2re} =C# +o (0<¢<1) 


n=1 


then the conditions (5) and 


(9) j x **{log|f(—x)| — 2 cot ro-n(x)}dx 





“a Cc 
~ «A(p — c) (cot xp — cot xa)r”” + = - 
a(1 — cos 2c) 
are equivalent. When p = a, (9) ts to be interpreted as 
(9’) j x **flog|f(—x)| — xcot mp-n(x)}dx = 
oO 
aC 


2 2 
— 2x A cosec sp + ————_=>—- . 
° p(1 — cos 2p) 
On putting p = o = 3 in the above theorem and interpreting the result 
in terms of functions of order 1, we get Pfluger’s theorem. Since (8) is satisfied 
a priori for every ¢ > p we have the following 


CorROLLARY. Jf 
~y Z 
f(z) = TI] (1 - 5) 


is an entire function of order less than 1, and p < a < 1 then the conditions (5) 
and (9) are equivalent. 












42 Q. I. RAHMAN 


Proof of Theorem 4. We prove the theorem by comparing f(z) with another 
function which has real negative zeros. Let 


where r, = |z,|. We have 





Sr 
fad 





log| fA 
' 
The series on the right has ies terms and so 


(10) fe ~*Tlog|f(—x) |dx = fone + 


The number of zeros of F(z) in |z| < xis n(x, F) = n(x, f) = n(x). Subtracting 
f x “x cot ro-n(x) dx 
0 

from both sides of (10), we get 


(11) fix —*tlog|f(—x)| — 2 cot ro-n(x)} dx 
0 


= f =? *tog\F(—x)| — xcot ra-n(x)} dx + > flog | +2 dx. 
0 n=1 0 | 


We now show that, as r — o, the limit (finite or infinite) of the sum on the 
right is 

= * 1-18 cin (6, + ae + cin See! 

—_——— s r)o + sin 2mc}. 

a(1 — cos2ac) = ” . 





(12) 


To do this put 


o(z) = log Sts 
t,— 2 
where it is assumed that z, is not real and negative. If the value of 2° is the 
principal value and we integrate z~'~°¢(z) around the contour consisting of 
the circle |z| = r with a cut from r to 0 and back again having indentations to 
avoid r,’s and the origin, then ¢(z) increases by 27i as we traverse the contour 
starting at z = r. On integration by parts 


2 
feo) de = - 3 ory 1 fe *¢'(z) 
9 
ac ct +— ) ae 
z 2 %—s8 


— 1 o = @ —anti¢e 
aes ~~ — = girs*(1 +6") 
o 














er 


dx. 


ing 


dx. 


the 


the 
. of 


s to 
our 


ZEROS OF INTEGRAL FUNCTIONS 43 


2x1 2n1 et 4+2))- 1 —2 
= a ea — {r, ~ ag a — > tie “(1 +e a 
2at , 2a 0 he 
x —¢ —1(On+") -2 
= —— + ane r.°¢ +2)e ~~ +e we) 
or og a 
As r— @, the integral along |z| = r tends to zero, so we have (combining 


the integrals along the two sides of the cut and equating real parts in the 
limiting form of the equation) 








(1 — cos 2x0) fx = log _ me dx = =r" {2 sin (0, + x)o + sin 27c)} 
or 
—— = lo. + x T . , : 
l ,| a+] — — aS 2 @ " 
f= ol ol dx ol aut {2 sin (6, + r)o + sin 2x}. 


F(z) has only real negative zeros and n(r, F) = n(r, f) ~ Ar’. Therefore 
(Theorem 3 above) the integral on the right-hand side of (11) is 
~a«A (p — o)~(cot rp — cot ma) r?-? 
which is to be interpreted for ¢ = p as —* A cosec*rp. Hence if we suppose 
that (8) and (5) hold, then (9) will hold. The fact that (8) and (9) imply (5) 
is immediate. 


2. The following theorem of the same general nature has been proved by 


Clunie (3). 


THEOREM 5. Let f(z) be an integral function of genus zero and lower order \f 
0 <A < 1, which has all but a finite number of its zeros, Z,, in the upper hal, 
plane. If Re, = 0(\2,|) as n — @, then the conditions 


(3) lim x~* log|f(x)| = 4 2 A cosec 4 mp 
and 
(5) lim r’ n(r) = 


Tox 


are equivalent. 


Let n4(r) and n_(r) count, respectively, the zeros in Im z > 0 and Imz < 0. 
Following the method of Clunie we prove the following extension of Theorem 


= 


0. 


THEOREM 6. Let f(z) be an integral function of genus zero and lower order i, 
0<A < 1. Jf at least one of the two numbers ns(r) and n_(r) is O(r*’) as r — @ 
and Rz, = 0(|Z,|) as nm — @~, then (13) implies (5). 


Proof. Let us suppose for definiteness that n_(r) = O(r*). Without loss of 
generality we may assume that f(0) = 1. Let, consequently, 


f(z) = at (1 _ ) = I (1 -_ 5) IT (1 ~_ s) = P(z)-Q(s), 











44 Q. I, RAHMAN 


where 5,, and c, denote respectively, the zeros lying in the upper half plane 
and the lower half plane. If 4 is fixed, 0 < 6 < x, and z = re“*-® then (3, 
p. 139) for m > mp(6) 





Hence 





asr— o, and thus asr — o~, 
log|P(z)| < log|P(r)| + o(1). 


Further, if we take 6 to be sufficiently small, then, for > mo(é), we will have 


Cn — 3] 
On — 7 


<2 


Hence 
]Q0s)| Fy len —2| onc 
teal cs pee. 





and thus asr > © 
log/Q(z)| < log/Q(r)| + O(r*). 


Therefore on the positive real axis and on the radius arg z =  — 6 we find 
that 
log|f(z)| = O(r*). 


By the Phragmen-Lindeléf principle it follows that f(z) is of order p and mean 
type. The rest of the argument is the same as that of Clunie (3, pp. 139-40). 


In conclusion I wish to thank Professor R. P. Boas, Jr. for his valuable 
guidance. I am also indebted to the referee for suggesting improvements. 


REFERENCES 


1. R. P. Boas, Jr., Entire functions (New York, 1954). 

2 Integral functions with negative zeros, Can. J. Math., & (1953), 179-84. 

3. J. Clunie, On a theorem of Noble, J}. Lond. Math. Soc. 32 (1956), 138-44. 

4. R. E. A. C. Paley and N. Wiener, Fourier transforms in the complex domain (New York, 
1934). 

5. A. Pfluger, Ueber gewisse ganze Funktionen vom Exponentialtypus, Comm. Math. Helvet. 
16 (1944), 1-18. 





Northwestern University 
and 
Muslim University, Aligarh 





0) 





ve 


nd 


an 


0). 


ble 


ork, 


lvet. 





A COSINE FUNCTIONAL EQUATION IN HILBERT 
SPACE 


SVETOZAR KUREPA 


Throughout this paper R denotes the set of all real numbers, m(K) the 
Lebesgue measure of K C R, H a Hilbert space, L(/7) the set of all linear 
continuous mappings of H into H, endowed with the usual structure of a Banach 
space. 

We consider the mapping F of the set R into L(#) such that 


(1) F(x + y) + F(x — y) = 2F(x) F(y) 


holds for all x, y © R. In (2) we have solved this equation under the assumption 
that H is of finite dimension. In this paper we prove that a weak measurability 
of F implies its weak continuity in the case of separable Hilbert space. In 
Theorem 2 we prove that every weakly continuous solution of (1) in the set 
of normal transformations has the form F(x) = cos (x), where the normal 
transformation N does not depend on x. 

We start with a preliminary lemma. 


LemMA 1. Let K be a linear Lebesgue measurable set such that 0 < m(K) < 
+ ©. There exists a number a > 0 with the property that for every x © (— a, a) 
there are s;(x), So(x), S3(x) € K such that s\(x) = so(x) — x/2 = s3(x) — x. 


Proof. Let u(x) be the function defined on the set of all real numbers 2 by 
the equation u(x) = m(K (\ (K — x/2)(\(K —x)). If x(t) denotes the 
characteristic function of the set K then 
u(x) — u(Q)| = Sx@x(t + x/2)x(t + x) — x(t)x(t + x) 

+ x(t)x(t + x) — x(d)\dt | 
< J\x(t + x/2) — x(t)|\dt + S\x(t + x) — x(oJdt. 


Since the right side tends to zero as x 0 we find the function u(x) con- 
tinuous in x = 0. Since u(0) = m(K) + 0, there exists a constant a > 0 such 
that u(x) # 0 for all x € (— a,a). But u(x) # 0 implies K (1) (K — x/2) 1 
(K — x) # @. Hence for each x (— a,a) there are s;(x), So(x), S3(x) K 
such that s;(x) = s(x) — x/2 = s3(x) — x and hence Lemma | is proved. 

THEOREM 1. Let F be a mapping of R into L(H) which satisfies (1) for every 
x,yER. 

Suppose that: (1) there is an interval I = |a, b| C R such that the restriction 
of F to I is weakly measurable; 

Received December 28, 1958. Presented under the same title to the International Congress 
of Mathematicians, Edinburgh, 1958. 


15 











46 SVETOZAR KUREPA 


(2) if F(x)f = 0, almost everywhere, then f = 0; (3) H is a separable Hilbert 
space. 
Then F is weakly continuous on R. 


Proof. We divide the proof into three parts. 


1. The function F is measurable on R. (1) implies: 


b-—a —— {b—a , b-—a 
o-¥54) - sneaks) — e439) 


When x runs through the interval [a, (a + 5)| then x + $(6 — a) runs over 
the interval [}(a + 5), 6]. Since F(y) is measurable on each of these intervals 
we find that F(y) is measurable on the interval [a — $(6 — a), a]. Thus, the 
measurability of the function F on the interval J implies the measurability 
of this function on the interval I’ = [a — 4(6 — a), 6). The way by which J’ 
is obtained from J enables us to deduce that the function F is measurable 
on the set (— ©, 5). For x = 0 (1) implies that F is an even function. Thus 
the function F is measurable on the set of all real numbers. 


2. The function F is locally bounded. The separability of H implies 
immediately that x — ||F(x)|| is a measurable function, hence there is a 
measurable set K C R of strictly positive measure such that L = sup || F(x)|| 
< + o, (x € K). We assert that || F(x)!| is bounded on every finite interval. 
Since the function F is an even function we can, without loss of generality, 
assume that K C [0,+ 0]. If we put x + y instead of y in (1) we get: 
F(x) = 2F(x + y)F(y) — F(x + 2y). This implies: 


(2) F(x)|| < 2||F(x + y)|| . ||FG)|| + [| F@ + 2y)|!. 
For x = y (1) implies: F(2x) = 2F*(x) — E and this gives: 

(3) \| F(2x)|| < 2\|F(x)||? + 1. 

From (2) and (3) we get: 

(4) I|F(%)|| < 2\|F@ + y)|] - |FO)|] + 211F& + 4x)|I? + 1. 


According to Lemma 1 there exists a number a > 0 with the property that 
for every x € (0, a) a number y can be found such that y, y + $x, y + x € K. 
If x € (0,a) and if y is the corresponding element of K then (4) implies: 
|| F(x)|| < 4L? + 1 for every x € (0, a). Thus the function ||F(x)|| is bounded 
on the interval (0,a). This and (3) imply that ||F(x)|| is bounded on the 
interval (0, 2a). From this we infer that the function || F(x)|| is bounded on 
every finite interval of the type (0, 6), (6 > 0). Since F is an even function 
we have that it is bounded on every finite interval. 


3. The function F is weakly continuous. Since the function F(x) is 
measurable and locally bounded, the functional 





h 





= = a. 





A COSINE FUNCTIONAL EQUATION 47 


(5) Jf rey, «) ax 


is a bounded linear function on H for any a,b € R and g € H. There is, 
therefore, a unique element g,, € H such that: 


J Fe. #) de = Gta) 


for every f € H. Let H’ denote the set of all g,,. We assert that H’ is dense 
everywhere on H. In fact, let h € H, h 1 H’, that is, let 


ed 
(6) f (F(x)h, g) dx = 0 


for all g € H and for all numbers a and 3b. For given, but arbitrary g, (6) 
implies: 
(7) (F(x) h, g) =0 


for x ¢.S, where mS, = 0. Let A = {g:, go, gs,...,} be a countable set dense 


in HT and let 


fe) 


S=U S,.. 


1 
According to (7) we have 
(8) (F(x)h, gn) = 0 


for all x ¢.S. Since A is dense in H (8) implies F(x)h = 0 for every x ¢ 5S, 
that is, almost everywhere. The requirement of Theorem 1 implies A = 0, 
that is, the set H’ is dense in H. 
If we put 2F(y)f instead of f in (5) and if we use (1) we find: 
b+y 


(9) 2(FOM, gos) = J (Fee) de + J (FURY, g) de. 


If y, tends to yo, then (9) implies: (F(y,)f, h) — (F(yo)f, 4) for every h € H’. 
Since the sequence F(y,)f is bounded and since H’ is dense in H we find 
(F(x)f, g) > (FOvo)f, g) 

for each pair f,g € H, that is, F(y,) tends weakly to F(yo) whenever », 
tends to yo. This proves that F is weakly continuous. Q.e.d. 

THEOREM 2. Let N(x) be a mapping of R into L(H) which satisfies (1) for 
every x,y ER. 

Suppose that: (1) N(x) is a normal transformation for every x € R; (2) if 
N (x)f = 0, almost everywhere, then f = 0; (3) N(x) is weakly continuous. 

Then a bounded self-adjoint transformation B and self-adjoint transformation 
A which commutes with B can be found in such a way that 


N(x) = $[exp (ixN) + exp (— ixN)] = cos (x) 
holds for all x where N= A+iB. 











48 SVETOZAR KUREPA 
Proof. |. As in Theorem 1 we have 


ed 
| (N (x)f, g) dx - VY, Lad). 


We assert that the set H’ of all g,, is dense in H. In fact if hk is an element 
of H which is orthogonal on H’, then (6) holds for all a,b € R and g € H. 
The continuity of function (V(x)f, g) together with (6) imply (7) for every 
x € R and for every g € H. From here we get V(x)h = 0 for all x which 
implies h = 0. Thus the set H’ is dense in H. Using (1) we obtain: 


N(x) —E Lo — 
(xe) — “f, ew») = 1 f (N (u)f, g) du + | (N(u)f, g) du 
~- eb 


x > 
a+r ‘a—z 
— j (N(u)f, g) du — J (N (u)f, g) du 


a a 


_ (Na) —-E, 
lim ( _) Bad | 3 gw») = 0 
r+0 x 


for every ga» © H’ and for every f © H. From here it follows that the sequence 


which implies: 


N*(x) — E 


x 


h 
converges weakly to zero for every h € H’, when x — 0. There exists, there- 
fore, a number M(h) such that: 
|[N(2-") — E] h|| < 2M (h). 
This implies that the series 


(10) > \|[V(2™) — Ejh\|’ 


n=1 
is convergent for every h € H’. 

Il. The fact that V(x) is an even function implies that V(x) and V(y) 
commute one with another for every couple of real numbers x and y. Now 
we consider the functional equation (1) only for x and y from the set 

G = {rijr = 2-'k, 1, k = 0, +1, +2,...,}. 


Since G is countable and since N(r) and N(r’) (r,r’ € G) commute we find 


(4, p. 67), 
Nir) = ,r)E(A: 
(11) (r) J se r)E(A:) 


where E(A) is a real spectral measure and the function f(é, 7) is E(A)-measur- 
able and finite everywhere for every r € G. If we put (11) in (1) we get: 


(12) fEr+r)+ fr —r) = 2fé nf é, r’) 








d 





A COSINE FUNCTIONAL EQUATION 49 


for all r,r’ € G and for almost all — (G is countable!). Using (11) we can 
write (12) in the form: 


lim | D |fe2-") — 1/7 ||E(A,Al|’. 
Nex R k=l 


From the above it follows that the series 


(13) > ifée.2*) — 1)" 


n=1 
is convergent almost everywhere with respect to the measure ||E(A)/ |*. Since 
the set H’ is dense in H the series (13) is convergent almost everywhere 
with respect to E(A). Thus 


(14) f(&, 2) > 1 


almost everywhere with respect to E(A). It follows from (14) and (12) that 


(15) f(&, r) = $lexp iro(é) + exp (— ire(é))] 


hold true almost everywhere in £ and for all r © G (see (2, Lemma 4)). Here 
¢(£) is E(A)-measurable and everywhere finite complex-valued function. Thus 
the transformations 


(16) N= o(E)E(Az), A = f [Red(~)|E(A:) and B = [Im$(&)|E (As) 
~“R R ~“R 
are defined. Since 
N(r)|| = ess sup |f(&,7)| << + @ 
for every r © G, we find: 
ess sup |Im $(£)| < + ©, 
that is, the transformation B is bounded. Then (16), (15), and (11) imply: 
N(r) = $[exp (trN) + exp (— irV)] = cos (rN) 


for every r € G. By the weak continuity and the fact that the set G is dense 
on R we find: N(x) = cos (xN) for every x € R. 


Remark 1. If we consider a mapping r — N(r) of the set G in the set L(/7) 
such that: 


(1) N(r) is a normal transformation; 
(2) N(r +r’) + N(r’ —1r) = 2N(r)N(r’) for all r, r’ € G, and 
(3) lim ||N(1/2") — E|| =0 


then N(r) = cos (rN), where normal transformation N does not depend on 
r. Indeed the representation (11) holds in this case too. Since || V(r) ess 
sup |f(&, r)| (14) also holds. This together with (11) leads to (12) and conse- 
quently to (15), from which N(r) = cos (rN) follows. 











50 SVETOZAR KUREPA 


REFERENCES 
1. R. Phillips and E. Hille, Functional analysis and semigroups, Amer. Math. Sci. Coll. Pub. 
(1957). 
2. S. Kurepa, A cosine functional equation in n-dimensional vector space, Glasnik mat. fiz. i 
astr., 13 (1958), 169-189. 


3. —— On the (C)-property of functions, Glasnik mat. fiz. i astr., 13 (1958), 33-38. 
4. B. Sz. Nagy, Spektraldarstellung Linearer Transformationen des Hilberschen Raumes (Berlin, 
1942). 


Department of Mathematics, Zagreb 


' 











SHEETS OF REAL ANALYTIC VARIETIES 
ANDREW H. WALLACE 


Introduction. In a previous paper (4) the author worked out some results 
on the analytic connectivity properties of real algebraic varieties, that is to 
say, properties associated with the joining of points of the variety by analytic 
arcs lying on the variety. It is natural to ask whether these properties can be 
carried over to analytic varieties, since the proofs in the algebraic case depend 
mainly on local properties. But although this generalization can be carried 
out to a large extent, there are, nevertheless, difficulties in the analytic case, 
owing mainly to the fact (cf. 2, § 11) that a real analytic variety may not be 
definable by means of a set of global equations. Thus, although the general 
idea of the treatment given here is the same as in (4), some variation in the 
details of the method has proved to be necessary, and some of the final results 
are slightly weaker in form. 

As in (4) the key result is an approximation theorem for piecewise analytic 
curves on a variety; this theorem is stated in § 2 and then proved in §§ 3 
and 4. In § 5 the approximation theorem is applied to the discussion of the 
sheets, that is, the maximal analytically connected sets, of a real analytic 
variety. 

As regards further literature on the subject of real analytic varieties, see 
Whitney (5 and 6); in the former paper an approximation theorem of the 
type just mentioned is proved for analytic manifolds (that is, varieties without 
singularities), while in the latter certain decomposition theorems are obtained 
for a wide class of varieties. 


1. Real analytic varieties. In this paper the term real analytic variety 
will be applied to a set V in a fixed Euclidean space E, such that V is closed 
in E, and each point p of V has a neighbourhood U in the ambient space such 
that UC) V is the set of zeros of a finite collection of functions analytic in 
U. Thus the term is equivalent to ‘‘sous-ensemble analytique”’ as in (2). At 
each point p of V, V defines a germ of a real analytic variety V,. Write V,’ 
for the complexification of V,, that is to say, the smallest germ of a complex 
analytic variety containing V, and contained in the complex n-space obtained 
by allowing the co-ordinates in E, to take complex values. The dimension 
of V,’ (that is to say, the dimension of the highest dimensional component 
of V,’) will be called the local dimension of V at p, to be written as dim,V. 
Dim, V has a maximum (< 2) over all points p of V; this maximum will be 
called dimV. 


Received November 3, 1958. 














52 ANDREW H. WALLACE 


A regular or simple point of V is a point p at which dim,V = dimV and at 
which local analytic co-ordinates can be set up in £, in such a way that V 
has locally the equations x,4; = X,42 = ... = x, = 0. A singular point of V 
is a point which is not regular; note that this includes any point where the 
local dimension is less than the maximum for V. The set of all singular points 
of V will be called the singular locus of V. 

It is essential at this point to note that the singular locus of a real analytic 
variety is not necessarily an analytic variety. For example, consider the 
analytic variety defined in E; by the single equation x*y* — 2*(y + 2) = 0. 
The cross-section of this surface by a plane y = constant is a cubic curve 
with a loop, and as y tends to zero this loop flattens out into the line segment 
on the x-axis joining the points x = + 2/3 /3. It is then easy to check that 
the singular locus of this variety consists of this line segment along with the 
whole y-axis, and this set is certainly not an analytic variety. Of course the 
surface under consideration could be regarded as a real algebraic variety, in 
which case the whole of the x-axis would be included in the singular locus, 
and this locus would be an algebraic sub-variety. The essential difference is 
that for real analytic varieties the regularity or otherwise of a point is deter- 
mined by local equations, and not, as in the algebraic case, by global equations 
(which in general do not exist in the analytic case; cf. (2)). 

Another feature of the example just given concerns the approximation of 
analytic arcs (definition at the beginning of § 2) on the surface V with the 
equation x*y* — 2*(y + z) = 0. Take two regular points of V on the x-axis 
and on opposite sides of the origin, say the points (+ 1, 0,0), and call them 
p and g. The segment pg of the x-axis is an analytic arc joining p and gq but 
there is no other arc joining these points in such a direct manner. In fact if C 
is any other arc on V joining p and g and if K is its projection on the (x, y)- 
plane, then the part of K lying in the strip defined by |x| < 2/3+/3 is covered 
three times by part of C. It follows, for example, from this that an approxi- 
mation of K, however good, cannot be lifted to an approximation of C in V 
in the manner of (4). The trouble is that in (4) the success of the method 
used depended on the fact that the arcs studied never had more than 
finitely many points in common with the singular locus of the variety. But, as 
can be seen from the present example, an arc on a real analytic variety can 
have a sub-arc in common with the singular locus, even when the end-points 
are regular. 

The example just given indicates that, in order to study properties of real 
analytic varieties analogous to those studied in (4) for algebraic varieties, a 
weaker form of approximation for curves will have to be used, in which the 
approximation C’ of a given curve C will lie in a preassigned neighbourhood 
of C, but C’ will be mapped on C by a mapping which may be (at least along 
certain arcs) many-one. 


2. Statement of the approximation theroem. For convenience some 


oe — 


a 





SHEETS OF REAL ANALYTIC VARIETIES 53 


of the definitions of (4) will be repeated here. An analytic arc in Euclidean 
n-space E, is an arc given by parametric equations x, = f,(t),i = 1,2,..., mn, 
where the f; are real analytic functions of t. The end-points of such an arc 
are assumed to be non-singular; that is to say, the given equations define a 
simple linear branch at each end-point. A piecewise analytic curve is a union 
of finitely many analytic arcs joined end to end, in such a way that at a com- 
mon end-point P of two of the arcs, say C,; and C2, exactly these two arcs 
meet and no others, and C,; and C; have distinct tangents at P. Each point P 
of the type described is called a joint of the curve. A curve C’ is said to be 
an €-approximation of a curve C if there is a homeomorphism f of C’ on C 
such that the distance of p from f(p) is less than ¢e for each p. C’ and C are 
said to be analytically equivalent at p if f(p) = p, and if, in a sufficiently 
small neighbourhood U of p, there is defined a mapping 7 of the form 
T(x.) = x; + h,(x), where the hf, are real analytic functions of x1, x2, ... , X» 
at p, such that T(C (\ U) = C’C\ U. Here the h,, expanded in power series 
at p, are assumed to be of order > 2; if they are of order > r, the analytic 
equivalence is said to be of order > r. 
One of the main results of (4), which will be required here is: 


LeMMA 2.1. Let C be a piecewise analytic curve in E, and let S be a finite set 
of points on C including ail singular points of C (note that the joints of C are 
not to be counted as singularities of C). Then for any preassigned ¢« and r there 
is an analytic curve C’ which is an e-approximation of C with analytic equivalence 
of order > r at each point of S. 


For any set A in E, the e-neighbourhood of A is the set of points in the 
union of all spheres of radius e with centres at the points of A. With this 
terminology the approximation theorem to be proved in this paper can be 
stated. 


THEOREM 1. Let V be a real analytic variety in E, and let C be a piecewise 
analytic curve on V, all the joints of C being regular on V. Then for any pre- 
assigned ¢ there is an analytic arc C’ on V and contained in the e-neighbourhood 
of C. In particular the end-points of C’ are within e-neighbourhoods of those 
of C, and if C is closed so is C’. 


The idea of the proof of this theorem is as follows. If dimV = r, C will be 
projected on a suitable r-dimensional linear subspace E, of E,. Writing the 
projected curve as K, Lemma 2.1 will be applied to give an approximation 
K’ of K. K’ is then to be lifted into V. In the case of a real algebraic variety 
V (or more generally a real analytic variety given by global equations, with 
singular points defined globally as in algebraic geometry), this lifting is in 
general carried out by a one-one correspondence, yielding the stronger approxi- 
mation theorem of (4). Here, however, the lifting has to be done by means 
of the local equations of V, splitting K’ into a sequence of arcs for the purpose, 
and lifting each one in turn. The lifted arcs are then to be strung together to 











54 ANDREW H. WALLACE 


give the required curve C’. As illustrated in the example of § 1 the sequence 
of lifted arcs may double back on itself covering parts of K’ several times. It 
has to be checked of course that C’ cannot break up into closed loops, or the 
last condition to be proved in the theorem would not hold. 

Most of the proof is taken up with the process of choosing a set of co-ordi- 
nates so that the projection just referred to is the orthogonal projection onto 
the space X-41 = X;42 =... = x, = 0. The method is to take a point p on 
C, and using local equations for V in a neighbourhood U(p) of p, to make a 
list of the various conditions unfavourable to the projection, approximation 
and lifting process described above. It turns out that, if co-ordinates are to 
be changed by orthogonal transformations, then the choices which are un- 
favourable, in U(p), correspond to an analytic subvariety in the space of 
orthogonal transformations. Since the curve C can be contained in a finite 
number of neighbourhoods of the type U(p) it follows that a choice of co- 
ordinates can be made which is favourable for the whole of C. 


3. Choice of co-ordinates. The procedure sketched at the end of the 
last section will now be carried out in detail. Take a point p of V as origin. In 
a neighbourhood U(p) of p, V is defined as the set of zeros of a finite number 


of power series in x1, X2,...,X, with real coefficients, convergent in U(p). 
Alternatively, V is the set of real zeros of an ideal J in the ring of power 
series in x1, X2,...,X, with real coefficients convergent near p. Write V’ for 


the complexification of V at p, that is to say, the smallest complex analytic 
variety, defined in a complex neighbourhood U’(p) of p (obtained by allowing 
all the co-ordinates to assume complex values) whose set of real points coin- 
cides with V (\ U(p). If J is the ideal of V at p then the ideal of V’ is generated 
by J in the ring of power series with complex coefficients convergent around p. 

Take an irreducible component V» of V(\ U(p) with dimV» = dimV, 
provided such a component exists. In the applications to follow this choice 
will always be possible; but to cover the contrary case, if dim,V # dimV, no 
restriction will be imposed on the choice of co-ordinates around p. The co- 
ordinates are now to be changed in such a way that the prime ideal of Vy at p 
becomes a regular ideal (1, p. 208). This means that, in the new co-ordinates 
Yi, Y2,--+ »¥n, the ideal is to contain no power series independent of y,41, 
Vr42,-+-»)¥n but for each h > r it must contain a power series regular with 
respect to y,, that is to say, having an exact power of y, among its terms 
of lowest order. The method of regularizing an ideal is explained in (1, p. 208, 
Theorem 4). It involves a sequence of linear changes of co-ordinates, which 
can in fact be taken to be orthogonal. And at each stage the condition that 
a change of co-ordinates should not be suitable is that the elements of the 
corresponding matrix should satisfy certain algebraic equations. However, 
the procedure followed in (1) is not quite suitable for the present purpose. 
For there the discussion is carried out in terms of formal series, after which 
a check has to be made as to the region of convergence. This region may 








SHEETS OF REAL ANALYTIC VARIETIES 55 


well depend on the particular choice of co-ordinates made, whereas here it is 
necessary to work in a sequence of steps, making sure that at each there is a 
region of convergence independent of the co-ordinates chosen. The following 
is a variant of the method of (1) designed to meet this requirement. 


LemMMA 3.1. Let C be a compact set of the real analytic variety V. Then co- 
ordinates can be chosen, making an orthogonal linear transformation from those 
originally given, such that, in a neighbourhood of each point p of C the co-ordi- 
nates of points on V satisfy a polynomial equation in x, with coefficients analytic 
O08 Ka, Bm oo + » Zang CF D. 


Proof. The set C used here will eventually be an analytic curve on V, but 
for the moment compactness is the only property wanted. Let p € C, and 
for convenience shift the origin of the given co-ordinates x, X2,...,X, in 
E, to p. In a neighbourhood U(p) of p the points of V are the zeros of ; 
ideal J in the ring of power series in the x, with real coefficients convergent 
around p. Take a series f in J and assume U(p) is such that f is convergent 
in U(p). Let A bea generic orthogonal m X n matrix and define the co-ordinates 
y, by y: = Ljn10;x,. Clearly there is an algebraic subvariety W(p) of the 
set O, of orthogonal m X mn matrices such that any specialization of A not 
in W(p) will give a set of co-ordinates y;,..., y, such that f is regular with 
respect to y,. Now, by the Weierstrass preparation theorem, f can be multi- 
plied by a power series in the y;, not vanishing at p, to give a polynomial g 
in y, whose highest coefficient is 1 and whose other coefficients all are power 
series in 1, Y2,...,¥Ye-1 Vanishing at ». Now the proof of the Weierstrass 
preparation theorem (1) shows that all the series involved in the theorem 
are convergent in a smaller neighbourhood U’(p) than U(p), obtained in fact 
by reducing the bounds of the various co-ordinates by a factor which depends 
on the upper bound of f in U(p). Bearing in mind that the linear change of 
co-ordinates being made here is orthogonal, thus leaving spheres invariant, it 
follows that if U(p) is taken as a spherical neighbourhood, U’(p) can be taken 
as a smaller sphere whose radius is a fraction of that of U(p). The fraction 
depends on the upper bound of f in U(p), but does not depend on the particular 
choice of the orthogonal matrix A. C, being compact, can be covered by a 
finite number of neighbourhoods of the type U’(p), and so the union of the 
corresponding W(p) makes up an algebraic subvariety of the set O,. If the 
matrix A changing the co-ordinates from the x, to the y, is chosen not in this 
subvariety it follows at once that the conditions of the lemma are satisfied 
by the new co-ordinates. 

Note that, as the conclusion of the lemma has been left, it is not necessarily 
true that the polynomial in y corresponding to some point p of C has its 
coefficients vanishing at p, unless p happens to be one of the finite set of 
points corresponding to the finite set of U’(p) covering C. However some 
factor of this polynomial will satisfy this condition if the origin is shifted so 
that y, = 0 at p. On the other hand, even without taking any further steps 











56 ANDREW H. WALLACE 


beyond the proof described above, the polynomial in y, corresponding to 
any point of C will always have the coefficient of the highest power of y, 
equal to 1. 

The choice of co-ordinates made in the last lemma is to fix y, once and for 
all, subsequent changes affecting only the co-ordinates, y;, Yo, . . . , Ye—-1. With 
the choice of co-ordinates just described, let p be any point of C, and repeat 
the argument of Lemma 3.1, replacing the ideal J of that lemma by the 
intersection of J with the ring of real analytic functions at p independent 
of y,. This argument shows that, for points of V in a neighbourhood of each 
point of C, y,—1 satisfies a polynomial equation with coefficients analytic in 
Yi, ¥2,- ++ 5 Yn—2 at Pp, in particular the coefficient of the highest power of y,_; 
being 1. Proceeding in this way step by step, the following result is obtained: 


LEMMA 3.2. With the assumptions of the last lemma, there ia a choice of co- 
ordinates x1, X2,....,X, in E, such that in some neighbourhood of each point 
b of C, the co-ordinates of points on V satisfy a set of equations of the form: 


(1) f (x3, Xo, ..., X41) = 0,2 = 1,2,...,8 — ss, 


where f; is a polynomial in x,,; with coefficients real analytic in x, X2, X3,..., 
Xr44-1 At p, the coefficient of the highest power of x,.,; being 1. 


It is to be understood in this statement that the set of f; may change when 
the point ~ is changed. On the other hand, in the case which is to be considered 
later, it will be true that dim,V = dimV at each point p of C, and this will 
be precisely the value of r for each set of equations of the type (1). 

A further adjustment to the co-ordinate system is necessary to enable the 
local equations of V to be brought into a certain canonical form. Let p be a 


point of V at which dim,V = dimV = r, and let Vo be an r-dimensional 
component of V in a neighbourhood of p. Taking as origin let RX, denote the 
ring of power series in x1, X2,...,X, with real coefficients convergent around 


pb and let J be the ideal of Vo in this ring. In the residue class ring R,/J let 
—, be the residue class of x,, for each 1. Then equations (1) are satisfied with 
(x1, X2,...,%X,) replaced by (£1, £,...,&,), from which it follows at once that 
the quotient field of R,/TJ is a finite algebraic extension of that of R,, the ring 
of convergent power series in £;, f2,..., £, with real coefficients. In addition, 
the dimensional condition imposed at p implies that the £; for i = 1,2,...,7 
are independent indeterminates over the real numbers. The object of the next 
bit of working is to pick out a primitive root for this extension, and then to 
change the co-ordinates so that this root will be the residue class of one of 
the co-ordinates. That this can be done locally at the point p is, of course, a 
well-known result. But here the idea is to make the choice of co-ordinates in 
such a way as to bear the relation just described to each component of V in 
some neighbourhood of each point of a compact subset C. 

Returning now to the notation just introduced, note that the £,,, for 
1=1,2,...,m — rare integral over R,. Let 1, ue, ... , up, be independent 








in 


TI 





SHEETS OF REAL ANALYTIC VARIETIES 


or 
~I 


indeterminates over the quotient field of R, and write R,’ = R,[u, us,. .. , u,—,). 
Then clearly 
n—r 
E= Do wees 
i=1 


is integral over R,’. Thus é will satisfy an equation of the type: 
(2) &* + a,t*"' +...+ 4, = 0, 


where each of the a, is in R,’. As regards the convergence conditions, each of 
the a; is a polynomial in the u, with coefficients in R,, and so only finitely 
many power series are involved in the equation (2), each series being con- 
vergent for sufficiently small values of the £,; (That these series are con- 
vergent at all can be seen, for example, from the fact that equation (2) can 
be derived by rational processes from equations satisfied by the individual 
Eu, t= 1,2,...,a—r, such as the equations (1), where convergence is 
known. Note that so far nothing is said or known about the reducibility or 
otherwise of (2).) Now the theorem of the primitive root for a finite algebraic 
extension (3) says that, provided the u, do not satisfy a set of linear equations 
(which they do not, being independent indeterminates), £ is a primitive root 


for the quotient field of R,’ (E41, E-42,..., &2). This means in particular that 
(3) Ey = F(a, &1, Es, . . . » Er, &) G(u, &1, £2 weaes E,) 
for each 1 = 1,2,...,” — r, where each F; is a polynomial in the u, and &, 


with coefficients in R,, and G is a polynomial in the u, with coefficients in 
R,. Then in all the equations (3) there are only finitely many power series in 


£}, &2,..., &,, all convergent for sufficiently small values of the &;. Identifying 
£, with x, for each 1 = 1,2,...,7, it follows that there is a neighbourhood 
U(p) of p in which all the series (in x;, x2, ...,*,) appearing in the equations 


(2) and (3) are convergent. Also denote by L(p) the set of linear equations 
in the u;, which must not be satisfied if & is to be a primitive root as just 
described. Then if C is a compact set on V it can be covered by a finite number 
of neighbourhoods of the type U(p), and the u, can be given real values such 
that none of the corresponding sets of equations L(p) are satisfied, and such 
that the rational functions appearing in the equations of the type (3) corre- 
sponding to each of these neighbourhoods are defined. At least one of the 
u, will be non-zero, say u;. Then take as a new set of co-ordinates in E, 
n—r 


Big B2y oo 0 gy Hoe ys U GX rt iy M42) - - + » Mme 
i=1 


Changing the notation so that these co-ordinates are again written as x;, Xe, 
. ,X, the result obtained can be summed up as follows: 


LEMMA 3.3. Let C be a compact subset of the real analytic variety V such that, 


at each point p of C, dim,V = dimV =r. Then co-ordinates in E, can be chosen 
in such a way that C is covered by a finite number of neighbourhoods in each of 











58 ANDREW H. WALLACE 


which the points of each r-dimensional local component of V satisfy equations 
of the form 


(4) F(x,41) = 0 
x, = F (x+41)/G, 1 m= 2.00, 8 7, 


where F and the F; are polynomials in x,+, with coefficients which are power 
series 1m X1, Xo,...,X,, and G is a power series in X, X2,...,X,, all the series 
being convergent in the relevant neighbourhood. 


(It is assumed in speaking of these power series that the origin has been shifted 
to a certain point of the neighbourhood in question.) 

Finally take any point p on the compact set C as origin. p will lie in some 
neighbourhood of the covering of C described in the last lemma, and so the 
points of the r-dimensional components of V at p will satisfy equations of 
the type (4), with G and the coefficients of the F; and of F real analytic 
at p. The irreducible factors of F corresponding to these components can now 
be picked out, and will have coefficients which can be written as power series 
in X1,X2,...,%X, Convergent in some neighbourhood of p; the co-ordinates 
X42, Xr43,..-,X_, for points of these components will still be given by (4), 
convergence of the series involved holding in some neighbourhood of p. These 
remarks enable a refinement of Lemma 3.3. to be stated: 


LemMaA 3.4. C being as in the last lemma each point p of C has a neighbourhood 
U(p) in which each r-dimensional component of V is exactly the set of points 
satisfying equations of the type (4) with F and the F, polynomials in x,.,; and 
(with p as origin) G and the coefficients of F and the F; power series in x1, X2,..., 
x, convergent in U(p). In addition, C, being compact, can be covered by a finite 
number of neighbourhoods of the type U(p). 


At this stage it is convenient to make a definition in preparation for the 
next section. In the notation of the last lemma, cover C by a finite number of 
the neighbourhoods U(p), and set up equations of the type (4) for each 
r-dimensional component of V in each such neighbourhood. Let D be the 
discriminant of F; it will be a series convergent in U(p). Then the set of points 
in (x1, X2,...,%X,)-space defined by G = D = 0 is a local analytic variety in 
the projection of U(p). The union of all the local varieties obtained in this 
way from all the r-dimensional local components of V in all the U(p) of the 
finite covering of C described in Lemma 3.4. will be called the branch locus 
of V relative to the sets of local equations described in that lemma (or simply 
branch locus if the context is clear). For brevity a set of points in a Euclidean 
space which, like the branch locus just introduced, in the union of a finite 
number of local analytic varieties, each defined in some neighbourhood, will 
be called an open variety. 


4. Displacement of an arc from an open variety. As already pointed 
out, one of the difficulties presented by real analytic varieties is that a curve, 





LAY 


ve, 





SHEETS OF REAL ANALYTIC VARIETIES 59 


although not lying entirely in the singular locus, may nevertheless have an 
arc in common with that locus. The lemma about to be proved is the main 
step towards resolving this difficulty. 


Lemma 4.1. Let V be an open variety in Euclidean n-space E,, and let C be 
an analytic arc contained in V. Then there exists in E, an analytic arc C’, which 
is an arbitrarily good approximation of C and which meets V only at finitely 
many points. 


Proof. Clearly no generality is lost by assuming that all the components 
of the various local analytic varieties of which V is composed are of dimen- 
sion m — 1; this can always be arranged if necessary by enlarging V. The 
discussion of the last section can now be applied to V. The argument is not 
affected by the fact that V is now an open variety rather than a real analytic 
variety, since at each stage only local properties are used. It then follows 
that co-ordinates can be chosen in such a way that C is covered by a finite 
number of neighbourhoods in each of which V is given by equations of the 
type (4). In this case these equations reduce to a single polynomial equation 
in x, with coefficients which are power series in the remaining variables con- 
vergent in the neighbourhood in question, a point in that neighbourhood 
being taken as origin. Let the parametric equations of C be x, = f,(t), 
i=1,2,...,m, where the f; are real analytic functions of ¢, and ¢ varies 
over some finite interval, say 0 < ¢ < 1. Then there is an analytic mapping 
of the (¢, x,)-plane into E, given by f(t, x.) = (f: (0), fe(t),..-, fas (8), Xn). The 
image of f in £, is a piece of analytic surface S containing C, and in particular 
C is the image of the curve C in the (, X,)-plane with the equation x, = f,(t). 
If F(xy, X2,...,%X,) = 0 is the equation defining one of the local varieties 
of which V is composed, then the equation 

F(fi(0), fet), ..- »fn—1(b), Xn) = 0 
defines a local variety in a neighbourhood of some point of the (¢, x,)-plane, 
and the union of all the local varieties obtained in this way forms an open 


variety in this plane. Denote this variety by V. It is clear that the image 
of V under f is the intersection of S and V, and, moreover, the points of V 
are the only ones mapped into S/\ V. And so if the lemma can be proved 
for the curve C relative to the open variety V in the (¢, x,)-plane, giving an 
approximation C’ of C meeting V at only finitely many points, then the curve 
C’ = {(C’) will satisfy the requirements of the lemma. In order to prove the 
lemma in the plane it is clearly sufficient to take C’ to be any sufficiently 
good approximation of C in the plane. But in view of the applications to be 
made of this result, it is necessary to make the approximation in a particular 
way, as will now be explained. The open variety V consists of a finite number 
of curve branches, each defined in some neighbourhood; let the finite collection 
of points p,; denote the centres of these branches along with a finite set of 
points arbitrarily chosen on C. Let t, be the value of ¢ at p;. Let g(t) be a 
polynomial in ¢ vanishing only at the ¢,;, to an order at least r. For example 











60 ANDREW H. WALLACE 


g(t) = I(t — ¢t,)’ willdo. Then, remembering that C has the equation x, = f, (t) 
define C’ as the curve with equation x, = f,(t) + Ag(t), where J is a real 
parameter. Note that, in a neighbourhood of each p; the power series express- 
ions of the parametric equation of C and of C’ differ only by high power of 
the parameter if r is taken large, and also that, as the parameter \ tends to 
zero, the approximation of C by C’ can be made arbitrarily close. Since one 
of the branches making up V at each #,; is part of C, and since C and C’ meet 
only at the p;, it is not hard to see that, for \ samll enough, C’ will meet V 
only at the p;. Applying the mapping f to the curve C’ constructed in this 
way, the following corollary of the above lemma is obtained: 


CorOLLary. In the above lemma, C’ can be constructed in such a way that, at 
each of the finitely many points where it meets V, each of its branches is associated 
with a branch of C, and, with a suitable choice of parameter, the parametric 
equations of these branches differ only by terms of high order with coefficients 
depending analytically on a parameter \ and tending to zero as \ tends to zero. 


The proof of Theorem 1 can now be undertaken, the following lemma giving 
the discussion of the most difficult step in the proof. 


LEMMA 4.2. Let V be a real analytic variety in E, and let C be an analytic 
arc on V with at least one of its end-points regular on V. Let Cy be the set of 
points on C for which the local dimension of V is r = dimV, and let co-ordinates 
be chosen in E, in accordance with Lemma 3.4, relative to the compact set C, 
on V. Let B be the branch locus im (x, X2,...,: x,)-space E, corresponding to 
this co-ordinate choice, and to the choice of local equations for V around a finite 
number of points of Co. Then in the e-neighbourhood of C on V, for preassigned e, 
there is an analytic arc C’ whose projection in E, meets B at only finitely many 
points. And in particular the end-points of C’ will lie in e-neighbourhoods of 
those of C. 


Proof. Let K be the projection of C in E,. If K(\B already consists of 
finitely many points there is nothing to be done, for C’ can be taken equal to 
C. But it may be that subarcs of K lie in B. In this case a set of points p, is 
to be defined on C, along with their projections g; on K (the gq, being not 
necessarily all distinct). The g; are to include all singularities of K and all 
isolated points of K (\ B, and the p; are to include all points of C projecting 
on these. Using local equations of the type (4) for V in neighbourhoods of a 
finite number of points of C, it may turn out that certain arcs on K can be 
lifted to give several copies on V, apart from those which form part of C. The 
p,; are to include the intersections of C with these other copies (these points 
will clearly be finite in number) and the g; are to include their projections. 
Finally, approximate K by an arc K’ in E, as in Lemma 4.1 and its corollary. 
According to these results K’ (\ B consists of finitely many points, and it is 
clear from the proof of the corollary that these can be assumed to include 
the g,; already defined. Any additional intersections of K’ and B are now to 





' 





SHEETS OF REAL ANALYTIC VARIETIES 61 


be included also among the g;, and as before all points of C projecting on 
them are to be taken among the p,. It will be remembered that according to 
the corollary to Lemma 4.1, K’ depends on a parameter \ and that, around 
the g;, K’ takes the limiting position K as \ tends to zero. It is now to be 
shown that, if the approximation of K by K’ is close enough and if \ is small 
enough there is an arc C’ on V satisfying the requirements of this lemma 
and projecting on K’. 

To establish the existence of this arc C’, sets of neighbourhoods covering 
C and K will now be constructed. K’ will then be made to lie in the union 
of these neighbourhoods, and will be divided into arcs each lying in one of 
the neighbourhoods. These arcs will then be lifted into V, and it will be shown 
that some of them can be joined end to end to form the required analytic 
arc C’. First define U(q,) as a neighbourhood of g,; such that the parametric 
equations of each branch of K’ at g;, when expressed in terms of a suitable 
parameter, give each co-ordinate as a power series convergent in U(g,), and 
in fact uniformly convergent with respect to A, and also such that, if these 
equations are substituted into the equations of the type (4) for V around p,, 
the resulting equations can be solved for x,41, X;42, .. . , X, aS fractional power 
series in the parameter, convergent for values of the parameter corresponding 
to points of K’ in U(q,). If several g, coincide, take U(q,) as the smallest 
of the corresponding neighbourhoods. U(p,) is then to be a neighbourhood 
of p; projecting onto U(q,). In addition U(p,) is assumed to be taken so 
small that no two curve branches on V at p,; projecting into branches of K’ 
at g; have any point in common other than p;. It should be noted that, as 
the results of § 3 are being used here, what has just been said makes sense 
only around the points of Co and their projections. It will appear presently, 
however, that C = Cy. The curves C, K, K’ will now be split up into a number 
of arcs for which it is convenient to introduce some terminology now. These 
arcs will be called C-arcs, K-arcs, or K’-arcs according to the curve they lie 
on. C-arcs lying in the neighbourhoods U(p,) will be said to be of the first 
kind. When these arcs are removed the remainder of C consists of finitely 
many disjoint non-singular arcs. Those whose projections are contained in B 
will be said to be of the second kind, and those whose projections do not 
meet B of the third kind. There is a certain amount of arbitrariness in the 
definition of B, depending as it does in the choice of particular neighbour- 
hoods; it is not hard to see that, by shrinking certain of these neigbourhoods 
if necessary, it can be arranged that the whole of C is a union of arcs of the 
three kinds described. The projections of these arcs will be called K-arcs of 
the first, second, and third kinds, respectively. 

Construct now an open covering of C by sets in V. Let y be a C-arc of the 
second or third kind, and let « be its projection in E,. Let U(y) be a neigh- 
bourhood of y not meeting C except in the points of an arc obtained by 
extending y slightly in each direction (not far enough to reach any of the 
p;). Also U(y) is to be chosen so that its projection U(x) meets K in the 











62 ANDREW H. WALLACE 


arc « slightly extended at each end. The set of U(p,) and U(y) covers C while 
the U(q,) and U(x) cover K. Now in each U(gq,) there will be a set of branches 
of K’ approximating K-arcs of the first kind, and if the approximation of K 
by K’ is sufficiently close each U(x) will contain exactly one non-singular 
arc of K’ approximating x. These arcs of K’ lying in the U(g,) and the U(x) 
will be called K’-arcs of the first, second, or third kind according to the kind 
of K-arc they approximate. 

C’-arcs, some of which will make up the required curve C’, will now be 
defined. To obtain a C’-arc of the first kind, take a K’-arc x’ of the first kind, 
through g;, say, and change the parameter on it so that it is zero at g;. Sub- 
stitute these parametric equations into the appropriate equations of the type 
(4) for V, and calculate all the corresponding roots x,,, as fractional power 
series in the parameter ¢. By the choice of the U(p,) and U(q,) convergence 
holds for these series for all ¢ corresponding to points on «’, and the resulting 
curve branches in £, will all actually lie in U(p,). If the fractional power 
series corresponding to one of these branches involve an even root of t, the 
end-points of that branch will lie in the same set U(y) for some y whenever 
the parameter \ on which K’ depends is taken small enough. On the other 
hand, if an odd root of ¢ is taken, the end-points of the branch will lie in 
two different U(y)’s for \ small enough. The set of all branches on V obtained 
in this way will be called C’-arcs of the first kind. Consider now a K’-arc «’ 
of the second kind approximating a K-arc x. x’ is to be lifted into V in a 
similar way to that applied to the arcs of the first kind. x’ in this case may 
pass through several neighbourhoods in each of which a different set of equa- 
tions of the type (4) for V must be used, and so «’ must be lifted in sections. 
A number of copies will be obtained of «’ lifted in this way into V, and, taking 
«’ to be a compact arc, it is clear that the points of certain of the lifted arcs 
will converge to the points of some C-arc lying over «x, as the parameter A 
on which K’ depends tends to zero, and the convergence is uniform with 
respect to the variable point on x’. If y’ is one of these lifted arcs, converging 
to the arc 7 as A tends to zero, y’ will lie in U(y) for \ sufficiently small. Assume 
now that A is so small that this happens for all arcs like y’ obtained in this 
way; these arcs will be called C-arcs of the second kind. C’-arcs of the third 
kind are obtained in the same way from the K’-arcs of the third kind. The 
only difference is that in this case there is a unique C’-arc over a given K’-arc. 

Suppose that the family of C’-arcs has been constructed and that \ is so 
small that the conditions mentioned relative to the arcs of the second and 
third kinds and the end-points of those of the first kind are satisfied. A maximal 
C’-arc can now be defined as a maximal union of C’-arcs which is itself an 
analytic arc. Since two distinct analytic arcs cannot have a subarc in common, 
it is clear that each C’-arc belongs to exactly one maximal arc, and also the 
C’-arcs forming a maximal arc follow one another in a well-defined sequence 
(defined by the variation of an analytic parameter on the maximal arc) in 
which each C’-arc is traced out exactly once, unless the maximal arc is closed. 





~ 


pe 
th 











SHEETS OF REAL ANALYTIC VARIETIES 63 


Now one end-point p of C is assumed, in the hypothesis of the lemma, to 
be regular on V. If y; is the C-arc on which lies, it follows that there is 
exactly one C’-arc y;' which has 7; as its limit when A tends to zero, and 
exactly one point p’ of that arc will have p as limit. The last statement rules 
out the possibility that ,’ is an arc of the first kind with parameter obtained 
from that of y; by taking an even root. Now define C’ to be the unique maximal 
C’-are starting off with y,;’. C’ is certainly not closed, for it has p’ as an end- 
point. The other end of ;’' will be joined to a second C’-arc y2', that to a third 
y;, and so on until the other end of C’ is reached. This must happen after a 
finite number of steps, since there are only finitely many C’-arcs, no one of 
which can be used twice. Also, it has been arranged that each C’-arc of the 
second or third kinds will lie in one of the U(y) and so can be assumed to end 
in one of the U(p,), so that it will join up to an arc of the first kind; and 
similarly each C’-arc of the first kind will join up to one of the second or 
third kind; all this with the exception of arcs which end near the end-points 
of C. It follows that the second end-point of C’ will lie in a preassigned neigh- 
bourhood of the second end-point of C, if \ is small enough. Also, the definition 
of the C’-arcs has ensured that, if \ is small enough, C’ will lie in a preassigned 
neighbourhood of C, and so the requirements of the lemma are satisfied by 
C’, whose projection K’ meets B at only finitely many points. 


COROLLARY. With the notations of the last lemma, Co, the set of points on C 
where the local dimension of V is r = dimV, coincides with C. 


Proof. The proof of this is implicit in what has gone before. If the result 
is not true there will be a point fp» different from the initial point p of C such 
that at all poiuts of the arc ppp» the local dimension of V is r, but following 
po and arbitrarily close to it there will be points of lower local dimension. po 
will in this case necessarily be among the points p; defined above. Let vy be 
the C-arc which passes beyond fp into the region of lower local dimension, 
x its projection, and «’ the K’-arc which approximates it. Then every C’-arc 
lying in U (po) over x’ must correspond to taking an even root of the parameter 
on «’, since the second half of «’ has no points of V over it near po. It would 
follow that C’ could not have a second end-point; for such an end-point could 
not lie over any point of K’ preceding go, and yet it follows from what has 
just been said that a moving point on C’, starting at p, must always double 
back when it reaches po, always projecting on a point of K’ which precedes 
go. But since there are only finitely many C’-arcs and C’ cannot be closed a 
contradiction is thus obtained which proves the corollary. 


The proof of Theorem 1 will now be completed. C is now to be a piece-wise 
analytic curve on V with all the joints at regular points of V. Co-ordinates 
in E, are to be chosen as in Lemma 3.4, the curve C being taken as the com- 
pact set; this choice of compact set is admissible in view of the corollary of 
the last lemma which shows that the local dimension of V is r at all points 











64 ANDREW H. WALLACE 


of C. Apply Lemma 4.2 to each of the analytic arcs of which C is composed. 
The result is a collection of analytic arcs C, lying in a preassigned neigh- 
bourhood of C, with end-points lying arbitrarily close to the end-points and 
joints of C, and with projections in E, meeting the branch locus B in finitely 
many points. In addition, since the joints of C are regular on V, it can be 
assumed that the end-points of the C, lie in cellular neighbourhoods of these 
points, and so they can be joined up by analytic arcs within these cells. And 
these new analytic arcs will have projections in E, not meeting B. Thus, given 
the piecewise analytic curve C, a new piecewise analytic curve C has been 
constructed in a preassigned neighbourhood of C, with its end-points in 
preassigned neighbourhoods of those of C, and such that the projection K 
of C in E, meets the branch locus B in at most finitely many points. Thus to 
complete the proof of Theorem 1 it is only necessary to prove the theorem 
for C. And the major difficulty has now been removed, namely that caused 
by subarcs of the given curve projecting into the branch locus. Using Theorem 3 
of (4), quoted above as Lemma 2.1, let K’ be an approximation of K, with 
analytic equivalence at all singularities of K and at points of K (\ B, but 
smoothing K at the joints. In addition, the results leading to Theorem 3 of 
(4) imply that K’ can be assumed to depend on a parameter \ in such a 
way that, as A tends to zero, K’ takes the limiting position K in a neigh- 
bourhood of each of the singularities and points of K (\ B. K’ is now to be 
lifted into V to give the analytic curve C’ required by Theorem 1. The simplest 
way of doing this is to repeat the argument of Lemma 4.2, with the simplifi- 
cation here that there are no arcs of the second kind. Alternatively the lifting 
can be done as in (4), using sets of local equations like (4) for V instead of 
the global equation which was available here. Note that if this second method 
is used, it is possible to make C’ and C analytically equivalent at the singu- 
larities of the latter. This may be of interest if C happens to be the curve 
which is given. However if the given curve is such that the initial adjustment 
replacing C by C (to avoid having arcs projecting into B) it necessary, any 
property of analytic equivalence is liable to be lost in the process. It is not 
hard to see that this adjustment will be necessary if and only if C has arcs 
in common with the singular locus of V, a situation which has been shown 
to be possible by the example of § 1. 

One further remark must be made concerning Theorem 1. It will be noticed 
that the approximating curve C’ given by that theorem passes through all 
the points p; constructed in the course of the proof of Lemma 4.2, and that 
along with the points which must belong to this set any finite collection of 
points on C can be included. This can be stated as a corollary which strengthens 
the result of the theorem: 


COROLLARY 2. In Theorem 1, C’ can be constructed so as to pass through each 
of a finite set of points arbitrarily given on C. In particular the end-points of C’ 
can be made to coincide with those of C. 














SHEETS OF REAL ANALYTIC VARIETIES 


~ 
oO 
wt 


5. Sheets of an analytic variety. Let V be a real analytic variety 
in E,. A subset S of V is said to be analytically connected if every pair of 
points of S can be joined by an analytic arc lying in S. A sheet of V is an 
analytically connected subset not contained in any larger analytically con- 
nected subset of V. The sneet S is said to be proper if it contains a point p 
with a neighbourhood U in E, such that V (\ U = S()\ U. This terminology 
agrees with that of (4). The following results correspond to some of the 
properties derived for sheets of real algebraic varieties in (4). 


LemMA 5.1. Let p, q, r be three points of a real analytic variety V and let q 
be regular on V. Then, if there are analytic arcs on V joining p to q and joining 
q to r, there is an analytic arc on V joining p to r and meeting a preassigned 
neighbourhood of q. 


Proof. The proof is as for Lemma 17.1 of (4), using here Theorem 1 and 
its Corollary 2 to approximate the union of the given arcs pg and gr by an 
analytic arc from to r. 


THEOREM 2. Let p be a regular point of the real analytic variety V and let S 
be the set of all points of V which can be joined to p by analytic arcs on V. Then 
S is a sheet of V, and every sheet of V containing a regular point can be con- 
structed in this way. 


Proof. The proof, using Lemma 5.1, is essentially the same as that of 
Theorem 13 of (4). 
The following two corollaries correspond similarly to the corollaries of 


Theorem 13 in (4). 
COROLLARY 1. Each regular point of V belongs to exactly one sheet. 
COROLLARY 2. Each sheet of V containing a regular point of V is proper. 


The next theorem corresponds to the dimensional homogeneity established 
in (4) for sheets of real algebraic varieties: 


THEOREM 3. Let S be a sheet of a real analytic variety V containing a regular 
point p of V. Then for any q on S, every neighbourhood of q contains a point q’ 
of S which is regular on V. 

Proof. Let C be an analytic arc joining p and g. Set up a co-ordinate system 
as for Lemma 4.2 relative to C, and apply that lemma, along with the Corollary 
2 at the end of § 4. This gives an analytic arc C’ joining p and g and with 
its projection K’ in E, meeting the branch locus B in only finitely many 
points. In particular, C’ itself can meet the singular locus of V in at most 
finitely many points, and so any neighbourhood of qg must contain a regular 
point g’ of V lying on C’. Since q’ is joined to the regular point p by an analytic 
arc on V, Theorem 2 along with its first corollary implies that gq’ is on S as 


required. 














66 ANDREW H. WALLACE 


A sheet S containing a regular point of V will be called r-dimensional. The 
local cellular decomposition described in § 18 of (4) carries over to the real 
analytic case, since the whole construction depends only on the setting up 
of local equations. The result is: 


LemMA 5.2. Let p be a point of a real analytic variety of dimension r and let 
W be a subvariety containing p (W in fact need only be defined in a neighbour- 
hood of p). Then in any preassigned neighbourhood of p there is a neighbourhood 
U of p such that V (\ U is the union of the closures of a set of disjoint open 
cells U, of dimensions <r such that: 

(1) UFrU; = UC\ W’ where W' is an analytic subvariety of V, defined at 
least around p, such that U(\ W'’ D UC\W. 

(2) Each r-cell in the decomposition of V (\ U is contained in exactly one 
proper sheet (note that here, unlike the algebraic case, this statement is only made 
for the cells of highest dimension). 

(3) p' U , for each i. 

(4) The neighbourhood U of p can be chosen so that all points of U (\ (V — W) 
can be joined to p by analytic arcs on V meeting W only at p. 


To make the statement of part (2) of the above lemma complete, note 
that each s-cell (for s < r) in the decomposition described there consists of 
regular points of a real analytic variety of dimension s defined in a neigh- 
bourhood of p. Such a cell is thus analytically connected, and so is contained 
in a maximal analytically connected subset of V, namely a sheet. But a 
sufficiently small neighbourhood of a point of the cell in question will meet V 
only at points of the cell, and so the sheet so obtained is proper. A slight 
strengthening of this statement gives the following theorem: 


THEOREM 4. Every point of a real analytic variety V belongs to a proper 
sheet of V. 


Proof. Take any point p on V, and construct a cellular decomposition of V 
around p as in Lemma 5.2. Let M be one of the cells, say of dimension s < r. 
As already pointed out, M is analytically connected. And so, by part (4) of 
Lemma 5.2, the set consisting of M along with the point # is analytically 
connected, and so is contained in some sheet. But a small neighbourhood of a 
point of M meets V only at points of M, and so this sheet is proper. 


It is worth noting that, in the notation of this theorem, the closure M of 
the cell M is analytically connected. For M is part of a real analytic variety 
Vo defined at least in a neighbourhood of ». Now take two points gq; and q2 
in M. If they are in M then they certainly be can joined by an analytic arc 
in M. Suppose gi € FrM. Then, applying Lemma 5.2 to Vo around qu, it 
follows that g; can be joined by an analytic arc y; in M to a point q; of M, 
and if gz € M then gq; and gz can be joined by an analytic arc y2 in M. Then, 
applying Lemma 5.1 to Vo (the fact that V» may be only locally defined 








er 








SHEETS OF REAL ANALYTIC VARIETIES 67 


makes no difference), it follows that the union of y; and yz can be replaced 
by an analytic arc in M joining qg; and qo. A similar argument can be used 
if both gq; and qe are in FrM. 

The following result corresponds to Theorem 14 of (4); note, however, the 
restriction as to dimension. 


THEOREM 5. Let V be a real analytic variety in E,, of dimension r. Then each 
r-dimensional sheet of V is a closed set. 


Proof. Let S be an r-dimensional sheet of V, and let p be in the closure S 
of S. Then a neighbourhood U of p contains a point g of S, which, by Theorem 
3, can be assumed to be regular on V. By part (4) of Lemma 5.2 if U is small 
enough p and g can be joined by an analytic arc on V, and so by Theorem 2 
and its Corollary 1, p belongs to the unique sheet determined by the regular 
point g, namely S. S is thus closed. 


In contrast to the algebraic case, the properties of the lower dimensional 
sheets of a real analytic variety are somewhat elusive. For although such 
sheets are contained in the singular locus of V, this locus may not be an 
analytic variety. On the other hand, the points of V at which the local dimen- 
sion is less than the maximum form a subset of the singular locus and this 
subset is (as will be shown in a future paper) part of a real analytic sub- 
variety. Consequently, the results proved above will all extend to the proper 
sheets of V, regardless of their dimension. The proposed proof of the assertion 
just made depends on attaching some sort of multiplicity to each singular 
point of V. This is done with reference to local systems of equations. Then, 
attempting to build up a variety of singular points, one fits the p-fold locus 
in one neighbourhood to the q-fold locus in an adjoining one, with p not 
necessarily equal to g. p and g are necessarily both even or both odd, and 
the process only breaks down if one of them is equal to 1. This cannot happen 
if one has started at a point of lower local dimension, where the multiplicity 
attached is always even. 


REFERENCES 


1. S. Bochner and W. T. Martin, Several complex variables (Princeton, 1946). 

2. H. Cartan, Variétés analytiques réelles et variétés analytiques complexes, Bull. Soc. Math. 
France, 85 (1957) 77-99. 

3. B. L. van der Waerden, Moderne algebra (Berlin, 1931). 

4. A. H. Wallace, Algebraic approximation of curves, Can. J. Math. 10 (1958), 242-278 

5. H. Whitney, Analytic coordinate systems and arcs in a manifold, Ann. Math., 38 (1937), 
809-818. 


6. H. Whitney and F. Bruhat, Quelques propriétés fondamentales des ensembles analytiques-réels, 
Comm. Math. Helvet., 33 (1959) 132-160. 


University of Toronto 











ON FINITE NILPOTENT GROUPS 


G. BACHMAN 


| 


1. Introduction and notations. It is well known that if (m, ¢(m)) = 1, 
where ¢(m) denotes the Euler ¢-function, then the only group of order n is 
the cyclic group. This is a special case of a more general result due to Dickson 
(2, p. 201); namely, if 

n= pi... Dp; 
where the /; are distinct primes and each a; > 9, the necessary and sufficient 
conditions that the only groups of order m are abelian are (1) each a; < 2 
and (2) no 


is divisible by any pi, ..., Py. 
We wish to establish a theorem which includes these two results. We let 
G(n) equal the number of groups of order nm where 


a, 
diene I] Pi, 
i=l 
and we seek necessary and sufficient conditions on m so that 
G(n) = [] Ge’) 
i=1 
Clearly, this problem is equivalent to finding necessary and sufficient con- 


ditions on n so that all existing groups of order nm be nilpotent. 
It will be shown that the following is true: 


THEOREM 1. Let 


n= pi pr... P7', 
where pi,..., Ps are distinct primes and each a; > 0. The necessary and suffi- 
cient conditions that the only groups of order n be nilpotent are: no p;,i = 1,..., S, 


shall divide any 


7 —1, p77’ -1,...,p;-1j=1,...,s. 


We introduce the following notations: the centre of a group A by Z(A), 
the group of all automorphisms of A by (A), the group of all inner auto- 
morphisms of A by ’(A), the factor group @(A)|#’(A) by (A), the direct 


Received November 17, 1958. The author wishes to express his appreciation to Professor 
H. N. Shapiro for suggesting this problem to him. 


68 





Ww 


al 


by 


or 





ON FINITE NILPOTENT GROUPS 69 


product of the two groups A and B by A X B, and the direct product of the 
n groups A;,...,A, by 

n 

x A ty 

i=1 
the order of the finite group A by |A|, and a Sylow p-group of a group G by S,. 

Let B be a group such that each element a € B is associated with an auto- 

morphism a — a* of A. Let G be an extension of A by B, that is, A is a normal 
subgroup of G and G|A ~ B. Then the elements of G can be written as g.a 
where the g,. are in one-to-one correspondence with the a € B, and a € A; 
also 


Za . 9b = gaaf (a, B)a% 
where f(a, 8) is a factor system. Moreover, 


fa Za = a", 
and 
(a*)’ = (a”)*”, 
Finally, to the extension G there corresponds a well-defined homomorphism 
6 of B into A(A) (3, pp. 121-126). If N is a normal subgroup of G whose 
order is prime to its index, then G splits over V (Schur’s theorem) (4, p. 132). 

In general, if A is abelian, then M%(A) = (A) and B is a group of operators 
for A, that is, A is a B-module. It is well known that the second cohomology 
group H?(B, A) is the group of all group extensions of A by B. If A and B 
are finite and (|B|, |A|) = 1, then H’(B, A) = 0 for all r (1, p. 237), in parti- 
cular, H?(B, A) = 0, so the only extensions of A by B are splitting extensions, 
that is, we can take f(a, 8) = 1, and, therefore, G contains a subgroup 
B’ =~ B such that A ()\ B’ = e, the identity element, and G = AB’. 

The consideration of non-abelian groups A is reduced to the abelian case 
by the following theorem: There exists a one-to-one correspondence between 
all non-equivalent extensions of A by B associated with @ and all non-equiva- 
lent extensions of Z(A) by the group of operators B corresponding to the 
homomorphism @ (3, pp. 142-145). 

In the case of an abelian group A, the non-equivalent splitting extensions 
of A by B are in one-to-one correspondence with the distinct homomorphisms 
of B into @(A) (3, p. 149). The kernel of the homomorphism will be denoted 
by W. If W = B, then we say that B acts trivially on A. 


2. Proof of Theorem 1. (1) Sufficiency: To proceed by induction, we 
assume that the statement is true for every 
3 Bs 
n’= p;'...P,',Bi Cayn’ <n. 
Now since 


(IG|, (pt! — 1)(¢7* — 1)... (6 — 1) = 1, 











70 G. BACHMAN 


we have, by Frobenius’ Theorem (4, p. 143), that the maximal p,-factor group 
of G is isomorphic to every Sylow p,-group of G, that is, G contains a normal 
subgroup N such that G/N ~ S,,, and G is a splitting extension of NV. But 
there exists a one-to-one correspondence between all non-equivalent extensions 
of N by S,; associated with @ and all non-equivalent extensions of Z(N) by 
the group of operators S,; corresponding to the homomorphism @. Thus we 
must consider splitting extensions, H, of the S,,-module Z(.V). By the induction 
hypothesis, 
N=XS,, 80 2Z(N) = X Z(S,,). 
jet oe 


Z(S,;) is an abelian group of order p,7/, 1 < y, < a,, from which it follows 
(4, p. 112) that |@(Z(NV))| is a divisor of 


& 
7 vj 7 vj-1l\, 
I] (Y — 1) — bs)... @Y - PF): 
aan 
jFt 
whence it is clear that we can only take W = S,,, which means trivial action, 


that is, the only extension of Z(NV) by S,; is S,; X Z(V). Therefore, by the 
one-to-one correspondence there is only one extension of N by S,,; associated 
with a given homomorphism @. Thus, the non-equivalent extensions of NV by 
S,; are in one-to-one correspondence with those homomorphisms of S,, into 
4%(V) which are associated with extensions of V by S,,. But 
(NV) = X ®(S,,) and l@(NV)| 

j=l 

ji 
is a divisor of 


Ss 
a aj—l 


I] (6 -— 107 — bs)... OF - 2% 


jet 


). 


Hence |%(.V)| is also a divisor of this number. Therefore, it is clear that the 
only possible homomorphism is the trivial one which implies that the only 
extension of NV by S,,; is S,; X N. But 
. s . 
N= x Ses; 
jet 
hence 


G =x 5S,,, 
1 


pi 
and G is nilpotent. 
(2) Necessity: Suppose some 
bi/ (pF — 1)... (bs — 1). 


Then we consider the following arrangement. Let 


A (ay, py) = Gy; X Gy KX... KX G, (a, times) 





WwW 


ol 


Si 


WwW 


m 


G 


bi 


Ir 
in 
bi 


Ww 


al 


' 








ON FINITE NILPOTENT GROUPS 71 


where C,, is the cyclic group of order p,; We denote by G(a;, p,) a group of 
order p**. Now, clearly, 
a f~ A(a;, p)) = X (A(a,, p,)). 
j=l 


j=l 


j#t jet 


Since, by assumption, 
X (A (ay, P;)) 
yt 


jt 


contains a subgroup of order p;, there exists a homomorphism: 


. ‘ x 
Gay Pp) > (x A (as, ps) 
j=l 
jet 
with kerne! a normal subgroup W of order p%*—". Associated with this homo- 
morphism, there is a splitting extension G of 
X A (ay, p;) by G(as, px) 
j=l 
ji 
for which W is the normal subgroup of G(a,, p;) which acts trivially on 
s , 
x A (ay, P;). 
j=l 
jet 
G is, of course, a group of order 
s 
n= |] p%, 
i=] 


but the extension G is not equivalent to 


Glas pd) X (* Alay, P)) . 
j=l 
j#i 
In fact, G is not isomorphic to this group for S,; = G(a;, p,) is not normal 
in G. Namely, if a~'gaa = gg then g.~'a~'g.a = gags, that is, (a~')*a = ga~'ge, 
but the ‘left side belongs to 


x A (ay, b;) 
while the right side belongs to S,;. However, 
So () (x A (ay, p»)) = ¢é, 
j=l 
SO gs = ga. Now if S,; were normal in G, then we would have a~'g,a = g, for 
all g. € S,; and all 
a c x A (a;, P;); 


j=l 
ji 











72 G. BACHMAN 


that is (a—')* = a“ for all @ and all a, which implies that S,,; acts trivially on 


xX Ala; p,;), or W= S,,, 
iat 


which is a contradiction. 
Therefore, S,; is not normal in G; hence G, of order n, is not nilpotent. 


CoROLLARY 1. Let G(mn) be the number of groups of erder n. If 


s 
n= || p%, 
i=1 


then the necessary and sufficient conditions in order that 
G(n) = [] G(p%') 
i=1 


are that no p;,1 = 1,2,..., 5, divides any 


aj—l 


(py — 1) (p7” — 1)... (6; — 1). 


Proof. There are G(p%‘) groups of order p;**. By taking all possible direct 
products it is clear that 


G(n) > [] Ge*), 
i=] 
and we have equality, if and only if the only groups of order m are direct 
products of their Sylow subgroups. 
It is clear that to have only abelian groups of order n, we.must have a, < 2 
for all 7; hence we get Dickson’s theorem as a special case of Theorem 1. 


REFERENCES 


H. Cartan and S. Eilenberg, Homological algebra (Princeton, 1956). 
L. E. Dickson, Trans. Amer. Math. Soc. 6 (1905), 201. 

. A. G. Kurosh, The theory of groups, vol. 11 (New York, 1956). 

H. Zassenhaus, The theory of groups (New York, 1949). 


Pen 


Rutgers University 





bo 





FINITE GROUPS WHICH ADMIT AN AUTOMORPHISM 
WITH FEW ORBITS 


DANIEL GORENSTEIN 


1. Introduction. In the course of investigating the structure of finite 
groups which have a representation in the form ABA, for suitable subgroups 
A and B, we have been forced to study groups G which admit an automor- 
phism ¢ such that every element of G lies in at least one of the orbits under @ 
of the elements g, g@’(g), gb"(g)o?"(g), gb’ (g)*"(g)o*"(g), etc., where g is a 
fixed element of G and r is a fixed integer. 

In a previous paper on ABA-groups written jointly with I. N. Herstein 
(4), we have treated the special case r = 0 (in which case every element 
of G can be expressed in the form $‘(g’)), and have shown that if the orders 
of @ and g are relatively prime, then G is either Abelian or the direct product 
of an Abelian group of odd order and the quaternion group of order 8. In 
another paper (3), the author has shown that if each element of G lies in 
exactly one of these orbits, then G must be an elementary Abelian group of 
type (p, p,...,p). The purpose of this paper is to prove more generally 
that any finite group G which admits an automorphism whose orbits are of 
the above form is necessarily solvable (Theorem 5). The burden of the proof 
rests on the case in which ¢ leaves only the identity element of G fixed, and 
in this case we shall show that G is in fact nilpotent (Theorem 4). 

In the course of the proof we first establish the nilpotency of G in the 
so-called non-exceptional case (in particular, if G is solvable) (Theorem 1). 
For this case our statement and argument resemble a result of Feit (2) and 
Higman (5), which asserts that a solvable group having an automorphism 
of prime order which leaves only the identity element fixed is necessarily 
nilpotent.* Their argument actually applies if G is assumed to be p-normal 
for all pjo(G). Recently it has been announced by J. G. Thompson that G 
must in fact be p-normal for all p|o(G) whenever G admits an automorphism 
of prime order leaving only the identity element of G fixed, from which it 
follows that an arbitrary group G admitting such an automorphism is necess- 
arily nilpotent.f 

However, not much is known concerning the structure of G if @ is of com- 
posite order. It is not difficult to construct a solvable non-nilpotent group G 
admitting an automorphism @¢ of composite order leaving only the identity 


Received October 22, 1958. 

*Feit proves the nilpotency of G under the weaker hypothesis that no subgroup of G has 
an exceptional group as a composition factor. 

tA.M.S. Notices, 5 (6) (November, 1958), 695. 


73 











74 DANIEL GORENSTEIN 


element of G fixed; and it is an open question whether G must be solvable 
to admit such an automorphism even when ¢ has order 4. 

We see then that our assumption on the orbits of G is a strong one since 
no other conditions on G or the order of ¢ are needed to prove that G is nil- 
potent if @ leaves only the identity element of G fixed. A direct consequence 
of this assumption is a simple inequality (Lemma 2.3) which exists between 
the order of ¢ and the order of G; and it is this inequality which lies at the 
heart of many of our arguments. 

In §§ 10 and 11 we shall determine the structure of groups of prime power 
order which admit an automorphism ¢ without non-trivial fixed elements 
satisfying our special condition on orbits, and shall show that such a group 
is either Abelian or of class 2 (Theorem 8). Combining this result with Theorem 
4, it will follow that any group G admitting such an automorphism ¢ without 
non-trivial fixed elements is either Abelian or nilpotent of class 2 (Theorem 9). 

In the final section we shall determine the precise connection between 
groups whose orbits satisfy this condition and groups of the form ABA. As 
an application we shall prove the solvability of a certain class of ABA-groups 
(Theorem 10). 

The author wishes to thank Prof. Herstein for his considerable help in the 
preparation of this paper, particularly with the proof of Lemma 3.1. 


2. ¢-groups. We shall call a group G a ¢-group if G admits an automor- 
phism ¢ such that every element of G can be expressed in the form $‘(g@’(g) ... 
¢”*-" (g)) for some fixed integer r and some fixed element g in G, i and j 
being arbitrary. The element g will be called a generator of G under ¢, and 
r will be called the index of G with respect to g, or simply the index of G. 

For simplicity we exclude the trivial case in which the order h of ¢ is 1. 
This implies, in particular, that 0(G) > 1. We may further assume that r\|h 
for otherwise set 7; = (r, 2%) and define ¢; = ¢’/". Then clearly ¢; has order 
h, G is a ¢:-group of index r; with respect to g, and we have r,\h. In the special 
case in which # = r, and hence in which every element of G can be expressed 
in the form ¢$‘(g’), we shall say that G is a ¢-group of index 0. 

We can imbed G as a normal subgroup of a group G*, which contains an 
element a of order A such that aga~' = ¢(g) for all g in G and such that 
G* = GA, where A denotes the subgroup generated by a. If ¢ is of prime 
order and leaves only the identity element of G fixed, it is easy to show that 
G* is a Frobenius group and that G is the regular subgroup of G*. By analogy 
with this case, we shall say, whenever ¢ leaves only the identity element of 
G fixed, that G is a regular $-group, and that ¢ is a Frobenius automorphism 
of G. . 

For brevity we also introduce the symbol [g],’ for the element g@’(g) .. . 
"4 (g). For completeness we set [g],° = 1. This symbol has several formal 
properties which we shall use repeatedly throughout the ensuing discussion, 
and which for convenience we incorporate into the following lemma: 





— aa i a ae 


eC 











AUTOMORPHISMS WITH FEW ORBITS 75 


LEMMA 2.1. Let @ be an automorphism ofo rder h of a group G. For any g in 
G and any integers i, j, k, r, we have ((g],*],;* = [g]-* and [g],’+* = [g],4"/((g],*). 
Furthermore, if h\r, |g\,4 = g’; while if r|h and $’ is Frobenius, (g),"/’ =1. 


Proof. All these relations except the last follow immediately from the defi- 
nition of the symbol [g],’. On the other hand, if @’ leaves only the identity 
element of G fixed, it is easy to see that g can be written in the form x~'¢’(x) 
for some x in G. But then [g],”/" = (x~'o"(x)) (6 (x7"" (x)... 6-7 (x7 (x)) 
= x ig*(x) = 1. 

The following lemma shows that the property of being a ¢-group carries 
over to subgroups and factor groups of G. 


LEMMA 2.2. Let G be a $-group of index r with respect to the generator g, and 
let H be a subgroup of G invariant under o. Then H is a o-group of index rs 
with respect to the generator |g\,* for some integer s. If H is normal in G and 
G = G/H then G is a $-group of index r with respect to the generator 9, where 
$,9 denote respectively the image of @ on G and the residue of gin G. Furthermore, 
no proper subgroup of G invariant under ¢ contains g. 


Proof. The last two statements of the lemma follow at once from the defini- 
tion of a -group. To prove the first assertion, let s be the least positive integer 
such that g; = [g],* is in H. Since H is invariant under ¢, every element of G 
of the form #‘((g:],,’) is in H. Conversely, if [g]," © H, write k = sj + ¢t and 
use Lemma 2.1 to get 


[els = lels’ o”’([elr) = [eil?.6"*’ (Lels), 


whence [g],‘ € H. Since s is the least positive integer with this property, 
t = 0, and it follows that every element of H is of the form $‘((g;],,7). Thus H 
is a ¢-group of index rs with generator [g],’. 

Finally, we shall establish a simple, but extremely important, relation 
between the order of @ and the order of G. 


LEMMA 2.3. Let G be a $-group of index r with respect to a generator g of G; 
let h be, the order of and let k be the least integer such that |g|," = 1. Then 
hk > o(G). In particular, if ¢’ is Frobenius, k = h/r. 


Proof. Since every element of G must be in the orbit under ¢ of one of the 
k elements [g],’, 7 = 1,2,...,, since each of these orbits contains at most 
h elements, and since the last one of them consists of only the identity element 
of G, the inequality hk > o0(G) is immediate. 

If ¢’ is Frobenius, Lemma 2.1 shows that [g],"/" = 1. The proof of this 
equality shows also that for any value of 7 < h/r, [g],/ ¥ 1. Thus k = h/r. 


3. Automorphisms of a class of groups of order p”g". In the next 
three sections we shall show that a regular ¢-group in which no subgroup 
has an exceptional group as a composition factor is nilpotent. The heart of 











76 DANIEL GORENSTEIN 


the problem is to prove this result for certain ¢-groups of order p"q"; §§ 3 
and 4 are devoted to this special case. 


LEMMA 3.1. Let G be a group of order p"q", p and q being primes, in which 
the p-Sylow subgroup P is normal in G and Abelian of type (p, p,..., >), 
while the q-Sylow subgroups are Abelian of type (q,q,...,4q); and assume that 
the centre of G is trivial. Suppose $ is an automorphism of G of order h such 
that no proper subgroup of P which is invariant under » is normal in G and 
such that some q-Sylow subgroup Q, but no proper subgroup of Q, is invariant 
under . Then if d is the order of on Q, we have d\m and h\d(p™/* — 1). 


Proof. Since G has no centre, p # g and m,n > 0. 

Since each element y in Q induces by conjugation an automorphism y, 
of P, there exists a group of automorphisms A acting on P which can be 
expressed in the form QR, where Q is normal in A and is isomorphic to Q 
under the correspondence y, <> y, where R is the cyclic subgroup generated 
by #, and where 


(1) ov, = ey for all y in Q. 


For all y in Q, we have ¢*(y) = y, and hence ¢~*y,¢* = y,. Thus ¢% is in 
the centre of A, and since Q is Abelian, the subgroup A» generated by ¢*% and 
Q is Abelian. 

We shall regard P as an m-dimensional vector space over the prime field 
K with p-elements, and A as a group of linear transformations acting on P. 
If K* denotes the algebraic closure of K and P*, the m-dimensional vector 
space over K*, we may also consider A as a group of linear transformations 
on P*. 

Now let W be a minimal subspace of P, invariant under ¢, and of dimension 
t, and let f(x), of degree ¢ and irreducible over K, be the minimal polynomial 
of ¢* on W. Since ¢* is in the centre of A, the subspaces ¢‘y,(W) are invariant 
under ¢% for all i and all y in Q, and ¢* has the same minimal polynomial 
f(x) on each of these subspaces. Let 


Po= >> o'v,(W). 
t.y 


It follows immediately from (1) that P» is left fixed by every element of A. 
Regarded as a subgroup of P, P» is thus normal in G and invariant under ¢, 
whence by our hypotheses Py = P. Since now P is the sum of minimal sub- 
spaces invariant under ¢%, it follows that P is the direct sum of subspaces 
W,, We,..., Ws, each of dimension ¢t, each invariant under ¢*, and on each 
of which the minimal polynomial of ¢* is f(x). Thus 


(2) m = st and f(x)* is the characteristic polynomial of ¢* on P. 


The order w of ¢* on P is the same as its order on each of the subspaces W;, 
and since f(x) is irreducible, it follows that w/p‘ — 1. In particular, this 














— 

















AUTOMORPHISMS WITH FEW ORBITS 77 


implies (w, p) = 1, and hence that the order of Ap is relatively prime to p. It 
follows that the representation of A» in P* is completely reducible. 

Now A, is Abelian and P* has coefficients in an algebraically closed field; 
hence we can find a vector x; # 0 in P* which is a common characteristic 
vector of every element in A». We shall show that for 0 < i < d the vectors 
¢‘(x;,) are also common characteristic vectors of A» and that they generate 
a d-dimensional subspace of P*, invariant under A. 

For each y in Q, we have 


(3) Voi (X1) = CiyX1 


for some element a, in K*. Thus @~Y¥,6'(x1) = We'q) (x1) = @yX1, so that 
vy (o'(x1)) = ay6'(x1), proving that ¢‘(x,) is a common characteristic vector 
of the elements of Q. Since ¢*/ and ¢‘ commute, $‘(x;) is also a characteristic 
vector of ¢*’, and hence of every element of Apo. 

Let P*, be the subspace of P* generated by the vectors $‘(x,). Since 
$*(x1) = bx, for some }; in K*, P*; is invariant under A; furthermore, the 
vectors x:, o(x:),..., ¢*(x:) are linearly dependent and hence dim P*, < d. 

Suppose if possible that dim P*, = k < d. Then for 0 < i < k the vectors 
$‘(x;) are linearly independent, and furthermore 


(4) * (x1) = Cox + Cr1p(%1) +... + Ce-1*"" (1), c, € K*, 
and co # 0. Apply y, to (4) and use (3) to obtain 
(5) AgyO* (x1) = CodoyX1 + Cidiyh(X1) +... + Ce-10y_-1,6*" (9). 
Now multiply (4) by a,, and subtract from (5), obtaining 
Co(Goy — Agy)X1 + Ci1(Aiy — Agy)G(%1) +... H Cea (Qp-1y — Gey) *"' (x1) = O. 


Since x;, ¢(x:),...,@*"'(x:) are linearly independent and since co # 0, we 
conclude that 


(6) Ary = Aoy- 
But (6) implies 
Oy 'O'y (x1) = doyd “Wz '6"* (x1) = doasz'x1 = X1. 


Thus x; is a common characteristic vector of all commutators ¢~“y,~'¢*y,, 
y € Q, with the common characteristic root 1. Since these linear transforma- 
tions are defined over K and | is in K, it is easy to show that they have a 
common characteristic vector z; = 0 in P with a common characteristic 
root 1. But then 


os OY, (x1) = O*(y(O*(y~"'21y) 9") = 21, 


and it follows that ¢~*(y)y~' is in the centralizer of ¢*(z,) for all y in Q. But 
Q is Abelian and hence the set of elements ¢~*(y)y~' form a subgroup Q» of 
Q, which is clearly invariant under ¢. Since k < d and d is the order of @ 











78 DANIEL GORENSTEIN 


on Q, Qo ¥ 1 and our hypotheses imply that Qy = Q. Thus ¢*(z;) commutes 
elementwise with Q, and since P is Abelian, lies in the centre of G, contrary 
to the fact that G has a trivial centre. Thus dim P*, = d, as asserted, and 
with respect to this basis, ¢ is represented on P*, by the companion matrix 


eee yg. 6) Call 

- J. ear 
(7) ®, = 

if 2 eae ae 

a i srr 


Since A» is completely reducible and leaves P* invariant, we can write 
P* = P,; @ P’, where P’ is invariant under Ao. If P’ ¥ 0, we can construct 
as above a d-dimensional subspace P*, C P’, invariant under A, and with 
respect to a suitable basis of P*:, @ will be represented by a companion matrix 
#., of the same form as %;, with possibly a different element 6, in the dth 
row, Ist column. Continuing this process, we can represent P* as the direct 
sum of subspaces P*;, P*2,..., P*,, each invariant under A and of dimension 
d, and with respect to a suitable basis of P*, @ is represented by the matrix 


®, 
P, 


Py 


where each #, is a companion matrix of the form (7), having some element 
b, of K* in its dth row, Ist column. In particular, 


(9) m = dx. 

From (8) we see that the characteristic polynomial of @ over P* is 
g(x) = (x* — by) (x* — be)... (xt — bh) and that the characteristic poly- 
nomial of ¢% is h(x)* = [(x — b,)(x — be)... (x — b)]*. Since @ is defined 


over P, the coefficients of g(x) and hence of h(x) are in K. A comparison with 
(2) now yields 


(10) f(x)* = h(x)*. 
But f(x) is irreducible, and hence d\s and A(x) = f(x)**. It follows that 
the roots b;, .. . , b, of (x) are roots of f(x) and hence lie in the field with p‘ 


elements. Since ts = m and djs the quantities 5, lie in the field with p™/“ 
elements, and hence have orders dividing p”/“ — 1. But by (8) ¢¢ is a diagonal 
matrix with );, bs,...,6, as diagonal entries, and it follows that the order 
of ¢* divides p”/“ — 1, which completes the proof of the lemma. 


LemMaA 3.2. If G satsifies the hypotheses of the preceding lemma, let F denote 














le 








AUTOMORPHISMS WITH FEW ORBITS 79 


the set of elements of G left fixed by $’, for some fixed integer r. Then either F C P, 
F=Q,or F=G. 


Proof. lf F Z P, there exists an element z in F with z = xy, x in P, and 
y # 1inQ. We have xy = z = $"(z) = ’(x)"(y), whence x@’(x~") = yo" (y~"). 
Since the left side of this equation is an element of P, while the right is an 
element of Q, each is the identity, and so ¢’(y) = y. Thus y € QC) F, which 
is invariant under ¢. But Q/\ F # 1 and it follows from the hypotheses of 
Lemma 3.1 that Q/\ F = Q. Thus either F C P or OC F. 

Suppose now that F > Q, whence F/\P #1. If x € FO\P, $'(yxy~") 
= $"(y)¢"(x)o"(y~") = yxy, and hence yxy is in F(\ P for any y in Q. 
Thus F/\P is normal in G, and being invariant under ¢, must equal P. Thus 
F contains P as well as Q, and we conclude that F = G. 


4. o-groups of order pq". We shall need a preliminary lemma. 


LemMA 4.1. Let G be an Abelian $-group of index r, of order p™ and of type 
(p, Pp, ..-,p) and let h be the order of o. Suppose d\r, d\m, and h\d(p™/* — 1). 
Then either d = 1 or d = 2, r # 0, and the subgroup F left elementwise fixed by 
¢” has order p. 


Proof. Let s = m/d. Since 0(¢*)|p* — 1, ¢* is completely reducible when 
considered as a linear transformation, and each of its irreducible constituents 
has dimension < s. Thus G is the direct product of subgroups G;, Go, ... , Gy 
invariant under ¢*%, each of order < p* and k > d. 

Let g be a generator of G under @ of index r and write g = gigs... ge, 
g.€ Gy, t = 1,2,..., k. Since G is Abelian, we have 


d 


(11) (el? = I] (edi. 


i=1 
Since ¢* leaves G, invariant and since djr, it follows that [g,],’ € G, for all 4, 7. 
Suppose first that r = 0. Then [g],?7 = 1 and we have hp > o(G), whence 
d(p* — 1) > p**", which implies d = 1 or d = 2, s = 1, and h = 2(p — 1). 
On the other hand, if r # 0, the element 
[ee 
has order 1 or p and is invariant under ¢’, whence by (11) the same is true of 
[e}?"~*. 
It follows in either case that 
ene? = 1 
and hence that h(p(p* — 1)) > o(G). Thus 
(12) d(p' — 1)? > ot, 


The only solutions of (12) are d =1, d = 2, ord = 3, s =1, and h = 3(p—1). 








80 DANIEL GORENSTEIN 


In the third case the G; are cyclic of order p, for 1 = 1,2,3 and are 
permuted cyclically by ¢. But then if the subgroup F left elementwise fixed 
by ¢’ were to contain some G;, it would follow that F = G, whence G would 
be of index 0 which is not the case. It follows that F = 1 and hence that 
[g],?-' = 1. This leads. as in (12), to the inequality 3(p — 1)? > p*, which 
is impossible. 


We show next that d = 2, h = 2(p* — 1) is impossible. In this case 
G = G,; @ Gz where Gi, G2 are invariant under ¢’, of order p*, and are per- 
muted by @. If either g; = 1 or go = 1 ‘([g],’) © Gi; U Ge, which is a proper 


subset of G. Thus we must have g = gige with g,; ¥ 1, ge ¥ 1. But now ¢’ 
has order p* — 1 on both G, and G, and so [g2],’ = 1 implies [g:],7 = 1. Thus 
the identity is the only element of G; which is of the form ¢‘([g],’), contrary 
to the fact that G is a ¢-group. 

Suppose next that d = 2 and h < 2(p* — 1). Since A/2(p* — 1), we conclude 
that h < p* — 1. But now [g],”"’" = 1 implies h?/r > p®* which is clearly 
impossible. Thus |[g],""" = x #1. Since $’(x) = x, the subgroup F left 
elementwise fixed by 9’ is not the identity. On the other hand, [g],?"/" = | 
and so h(h/r)p > p**. It follows that r < p. Now F is of index 0, and hence 
every element of F is of the form $‘(y’) for some element y in F. But @¢ has 
order r on F, and consequently rp > o(F). Since r < p, we conclude that 
F is cyclic, and the lemma is proved. 

We are now ready to prove our main result concerning ¢-groups of order 
p™q". 

LEMMA 4.2. If a $-group satisfies the conditions of Lemma 3.1, then @ leaves 
some element other than the identity fixed. 


Proof. Let g be a generator of G under ¢ of index r, and let F be the sub- 
group of G of fixed elements under ¢’. According to Lemma 3.2 either F C P, 
F=Q, or F=G. 


Case 1. F C P. Write g = xy, with x in P, y in Q. P is normal in G, and 
hence [g],’ = x,|y],’ for some x, in P. If t is the least integer such that [y],‘=1, 
then ¢ is the least integer such that [g],‘ is in P, and hence P is a ¢-group of 
index rt. Moreover, since Q is Abelian, it follows that ¢’'(y) = y. But now 
the subgroup of Q left fixed elementwise by $”' is invariant under ¢ and 
contains y, whence by our hypotheses it must equal Q. Thus the order d 
of @ on Q divides rt, the index of P. In view of Lemma 3.1, P now satisfies 
all the conditions of Lemma 4.1, and hence either d = 1, in which case @ 
is the identity on Q, or d = 2 and the subgroup F; of P left elementwise 
fixed by $”‘ is cyclic. 

In the latter case, ¢’ leaves only the identity element of Q fixed, since 
F C P, and hence ¢’ has order 2 on Q. It follows that $’(z) = z~' for all z 
in Q. In particular this implies ¢ = 2. Furthermore if ¥, denotes the auto- 
morphism of P induced by conjugation by an element z in Q, we also have 

















AUTOMORPHISMS WITH FEW ORBITS 81 


oye” = ¥,. If Fi = (x1), we conclude at once that ¢?"(W,(x,)) = ¥,(x,), 
whence y,(x,) € F; for all z in F;. Thus F; is normal in G, and being in- 
variant under ¢, F; = P, whence o(P) = p. Hence m = 1, contrary to the 
fact that d/m by Lemma 3.1. 


Case 2. F = Q. Since F # G, r # 0. If r = 1, every element of Q is left 
fixed by ¢. Hence we may assume r > 1. We have [y],* = y* = 1, and hence 
x_¢ = (g],* € P. Since ¢’ is without non-trivial fixed elements on P, [x,],”/" = 1, 
[g],“/" = 1, and h*g > ro(G) by Lemma 2.3. Since h\d(p™/* — 1), we have 


(13) d*(p™/¢ _ 1)? > rp™q"—". 


The only solutions of (13) are d = 1, in which case the lemma follows, or 
d = 2 and r = 2,3. If d = 2, $? leaves Q elementwise fixed, while if r = 3, 
¢* leaves Q elementwise fixed. Hence the case d = 2, r = 3 implies @ is the 
identity on Q. In the remaining case d = 2, r = 2, we have dir and hence by 
Lemma 4.1, the subgroup F;, of P left elementwise fixed by $”¢ is cyclic (since 
P is of index 2q). This leads to a contradiction as in Case 1. 


Case 3. F = G. This is the case r = 0. P is also of index 0, so that d divides 
the index of P, whence by Lemma 4.1, d = 1. Thus ¢ is the identity on Q, 
and the lemma is established. 

We wish to point out that there do exist ¢-groups satisfying the conditions 
of Lemma 3.1 in which ¢ leaves some non-trivial element of G fixed. Perhaps 
the simplest example is the symmetric group S; on three letters, which can 
be defined by the relations x* = y? = 1 and y xy“! = x~". It is easily checked 
that S; is a ¢-group of index 1 with respect to the automorphism @ defined 
by: o(x‘y’) = x~*y’, the element xy being a generator of S; under ¢. 


5. Solvable and non-exceptional ¢-groups. A group G is called excep- 
tional if G is a non-cyclic simple group in which the normalizer of every 
characteristic subgroup # 1 of a p-Sylow subgroup P of G is P, for all primes 
p\o(G). It is easily shown that if G is solvable or if every Sylow subgroup of 
G is Abelian, then no subgroup of G has a composition factor which is an excep- 
tional group (2, Lemma 4.1). 


THEOREM 1. Let G be a regular $-group and assume that no subgroup of G 
has a composition factor which is an exceptional group. Then G is nilpotent. 


Proof. The proof is by induction on the order of G, and consists in reducing 
to the case in which G satisfies the conditions of Lemma 3.1. This reduction 
is almost identical with that given by Feit (2, Lemma 4.2 and Theorem). 
However, as our group G need not be the regular subgroup of a Frobenius 
group, we shall outline the steps in this portion of the proof. 

We first show that G contains a normal subgroup of prime power order 
invariant under ¢. If G has a proper characteristic subgroup H, H is nilpotent 
by induction, and any of its Sylow subgroups are normal in G and invariant 








82 DANIEL GORENSTEIN 


under ¢. Otherwise G is the direct product of isomorphic non-exceptional simple 
groups. There exists then some p|o(G) such that a p-Sylow subgroup P of G 
contains a characteristic subgroup T such that N(T) > P. Since ¢ is a Fro- 
benius automorphism, some p-Sylow subgroup of G is invariant under ¢, 
and we may assume it to be P. Either T is normal in G (and ¢(T) = T) or 
by induction N(T) is nilpotent, P is normal in N(T), and hence N(P) > P. 
Either the centre C of P is normal in G, and invariant under ¢ or N(C) is 
nilpotent. 

If neither C nor T is normal in G, we have N(C) > N(P) > P. If Q is 
the unique g-Sylow subgroup of N(C) for some prime g # », and if Q is not 
normal in G, N(Q) is nilpotent and contains P, whence P and Q commute 
elementwise. If C D x Px, then Q > N(x~'Cx), which is nilpotent, so that 
Q commutes elementwise with x~'Px as well as P. Since N(Q) is nilpotent, 
x-'Px = P, and it follows that G is p-normal. But N(P) is also nilpotent, 
so that by a theorem of Griin (6, p. 171) G contains a normal subgroup H 
such that G/H = C, contradicting the fact that G is its own commutator 
subgroup. Thus G contains a normal subgroup of prime power order, invariant 
under @¢. 

Let P be a minimal such subgroup so that P is Abelian of type (, p,..., 
p). By induction G/P is nilpotent. If G is not a p-group, suppose g is a prime 
dividing 0(G), gq # p; and let Q be a minimal subgroup invariant under ¢ 
of a g-Sylow subgroup of G. If PQ < G, PQ is nilpotent and this, together 
with the fact that G/P is nilpotent, implies that Q is in the centre of G. But 
then G/Q and hence G is nilpotent. 

We may suppose therefore that G = PQ, the centre of G is trivial, no sub- 
group of P invariant under ¢ is normal in G, and no subgroup of Q is invariant 
under ¢—precisely the hypotheses of Lemma 3.1. But now Lemma 4.2 implies 
that there is no regular ¢-group which satisfies these conditions, and hence 
G is nilpotent. 


Coro.uary. If the Sylow subgroups of a regular ¢-group are G Abelian, then 
G ts Abelian. 


6. The fixed subgroup of ¢’. The subgroup left elementwise fixed by 
¢’ plays an important role in determining the structure of a ¢-group of index 
r. In this section we shall determine some of the properties of this subgroup 
for ¢-groups of prime power order. We shall need the following lemma: 


LemMA 6.1. Let G be a $-group of index 0 of order p* having a generator g 
of order p". Then G contains a sequence of characteristic subgroups G = G, D 
Gr-1 D ... D Gi D Go = 1 where G; is generated by the elements of order p*‘ in 
G. Moreover, the subgroups G, are the only subgroups of G invariant under ¢. 


Proof. Since G is a $-group of index 0, the elements $‘(g”"~'’) clearly include 
all elements of order p in G. Since no proper subset of these elements form a 








we 


\¥ 








AUTOMORPHISMS WITH FEW ORBITS 83 


subgroup invariant under ¢, they must form the characteristic subgroup G, 
of elements of order dividing p in the centre of G. As pointed out, no proper 
subgroup of G, is invariant under ¢. 

The lemma follows now easily by applying induction to the group G/G. 


THEOREM 2. Let G be a regular $-group of index r and order p*, and let F be 
the subgroup of G left elementwise fixed by o'. Then every subgroup of F in- 
variant under is normal in G. 


Proof. Since ¢’ leaves F elementwise fixed, F is of index 0, and hence by 
the preceding lemma the elements of order p in F form a characteristic sub- 
group F, of F. If F, is normal in G, the theorem follows by induction. For if 
we set G = G/F,, F = the residue of F in G, and F’ the subgroup of elements 
left elementwise fixed by the image $’ of ¢’, F C F’ and F’ is normal in G 


by induction. Since F is invariant under ¢, F is characteristic in F’ by the 
preceding lemma, and hence normal in G. Thus F is normal in G and the 
theorem follows at once. 

We shall actually prove that F; lies in the centre of G. Let h be the order 
of ¢, let g be a generator of G under ¢, and let 


gi = [g]?” 


be a generator of F;, so that F, is of index A4;. To our induction hypothesis 
we shall add the assertion that either 4/h, or h,/h is a power of p. 

Let us begin by verifying this statement under the assumption that F; is 
in the centre of G. Let k be the order of $ of G and let g be the residue of g 
in G. Let H be the set of elements of G left fixed by ¢* and suppose first the 
HD F,. Then HC) F, = 1 and hence ¢* is Frobenius on F;. Thus ¢*(g) = xg, 
x € F, and x = y~'¢*(y) for some y in F,. It follows that ¢*(gy~') = gy™', 
whence gy! € H. Thus g € FH. Since FH is invariant under ¢ and contains 
g, G = FiH. Since H = G, ¢ has order k on H. If (r, k) = s, it follows that 


(14) sa—, 


On the other hand, let H, be the subgroup of H generated by the elements 
of order p left elementwise fixed by ¢’. Then FH, is left elementwise fixed by 
¢’ and its elements all have order dividing p. It follows that Fi\H, = F,. Since 
F, (\ H; = 1, we conclude that H; = 1. Hence @¢’ leaves only the identity 
element of H fixed, and consequently $’ leaves only the identity element of G 
fixed. But this implies k/s is the least integer such that |g],*/" = 1, and hence 
g.: = (g],“’*. Thus rk/s is the index of Fi, and in view of (14) we conclude 
that A, = h. 

Hence we may suppose H > F;. In this case, the relation ¢*(g) = xg 
implies ¢*’(g) = g, and we have 


(15) k\h\kp. 








84 DANIEL GORENSTEIN 


Since r|h, it follows that either r = s or r = sp. If 6’ leaves only the identity 
element of G fixed, it follows as above that A, = kr/s, and hence ih, kp. We 
conclude from (15) that either h/h,; or h,/h is a power of p. 

If 6’ is not Frobenius, let F, be the subgroup of G generated by the elements 
of order p left elementwise fixed by ¢’. If k; is the index of F,, then by induction 
either k/k, or k,/k is a power of p. By definition of k,, 


al” 


is a generator of F,;, and hence 


= [gl” 


is a generator of the inverse image F2 of Ff. Since ¢’ leave fF; elementwise 
fixed and r{k,, 


¢'"(g2) = 2g2 
for some z in F;. Since F; is in the centre of F2, this implies 
(16) (gold, = 27°?” gf. 


As p is the least power of j for which g” € F;, it follows at once that 
h, = kip. Thus h, = kp* for some integer «. This together with (15) implies 
that either 4/h, or h,/h is a power of p. 

Finally we must show that F, does in fact lie in the centre of G. Let C be 
a minimal subgroup of the centre of G invariant under ¢. Because of the 
minimality of F,, either C = F, or C (\ F; = 1. In the latter case, let G, F,, 
2, @ be respectively G/C, the image of F; and g in G/C, and the image of @ 
on G/C. Let m be the order of G on F, and define M to be the subgroup of 
G left elementwise fixed by ¢”. 

Now by induction, if m, is the index of F,, we have 


- m 
(17) — = 7" 
my 
for some integer e. 
By definition of m,, rim,. If we write r = r,p*, where (r;, p) = 1, it follows 
that 
(18) r3\m. 


Since every element of F; is of the form ¢$‘(g;’), the order of ¢ on F, is 
relatively prime to p, and hence $” leaves F; elementwise fixed. It follows 
therefore from (18) that F; C M. 

Assume first that C C M, in which case CF; C M. Now the index of CF, = 
index of F; = my. Let g’ be a generator of CF, of index m, and write g’ = xy, 
x€ C,y © F,. If mim, 


[e’la. = g = xy’, 

















AUTOMORPHISMS WITH FEW ORBITS 85 


and every element of CF, is of the form $‘(x’y’), which is clearly impossible 
since C (\ F,; = 1. On the other hand, if m + m, (17) holds with « D 0, and 
in this case 

(19) (e’}a, - [x]a.9’. 


To obtain an element of F;, we must have 


j 
[x }m, - 1, 

and this implies ¢"'/(x) = x since C is Abelian. If 7 = 1, 
[e’Ims = x°y’, 


which is impossible as above. Since 


e 
o”” (x) = x, 


j # 1 implies p|j and hence 1 is the only element of F; which can be written 
in the form 
ns 
¢'([g lms) 
contrary to the fact that g’ is a generator of CF, under ¢. 

On the other hand, if C(\ M = 1, it follows as in an earlier part of the 
proof that G = CM. But M < Gand F,; C M so that by induction F;, is in 
the centre of M. Since C is in the centre of G, it follows that F, is in the 
centre of G, and the proof is complete. 


CorROLLARY. If F, denotes the subgroup of F generated by the elements of order 
pb in F, then F, lies in the centre of G. 


7. ¢-groups in which ¢’ leaves only the identity fixed. We shall also 
need some properties of ¢-groups of index r in which @’ is a Frobenius auto- 
morphism. To this end, we first establish the following lemma. 


LemMA 7.1. Let G be a regular $-group of prime power order, and let C be a 
subgroup of the centre of G, invariant under } and of least possible order. Then 
either C = G or [o(C)|? < o(G). 


Proof. We may suppose G > C. If G = G/C, we may restrict our attention 
to a minimal subgroup of the centre of G, and hence without loss of generality 


we may assume that G is Abelian of type (p, p,..., p) and that no proper 
subgroup of G is invariant under the image ¢ of ¢ on G. 

Let g be a generator of G, g its image in G, k the order of $, and H the 
subgroup of G left elementwise fixed by ¢*. If H (\ C = 1, it follows as in the 
preceding section that G = CH. Since G/C = H, H, and hence G, is Abelian. 
But by definition of C, 0(C) < o(H) and therefore [o(C)]? < 0(G). 

If, on the other hand, H > C, the equation ¢*(g) = yg implies ¢*’(g) = g 
so that h|kp, where h is the order of ¢. If h = k and ¢’ leaves only the identity 
element fixed, it follows as in the preceding section that the identity is the 








86 DANIEL GORENSTEIN 


only element C which can be written in the form $‘((g],’), which is a contra- 
diction. 

If h = k and some proper subgroup F of G is left elementwise fixed by 
¢*, either F(\ C = 1 or FDC. In the first case, since no proper subgroup 
of G is invariant under 4, it follows that G = CF, and hence G is Abelian 
since F &G is Abelian; and we have [o(C)]* < 0(G). 

If F > C, then F = G is of index 0, and hence C contains all elements of G 


of order p. If #1, Z2,...,Zm are a basis of G, let x1, x2,...,%m be a set of 
representatives such that $(x,) = x41,i = 1,2,...,m—1. Then 

}(X_) = 2xi'xs".. . xe. 
Since x? € C for all 7, it follows at once that x)’, xo?,...,Xm? generate a 


subgroup C, of C invariant under ¢. Since C is minimal, C; = C, and hence 
o(C) < p™ = o(G), which implies [o0(C)]? < o(G). 

Finally if F = C, C is of index 0, whence the order of ¢ on G is a multiple 
of p" — 1/p — 1, where p* = 0(C). This implies (p” — 1)/(p — 1)|k. But G 
is an Abelian group of type (p, p,...,) and hence k < o(G). Thus o(C) 
= p" < o(G) and [o(C)]* < 0(G), as desired. 

We now prove 


THEOREM 3. Let G be a regular $-group of index r and assume @’ leaves only 
the identity element of G fixed. Then either some Sylow subgroup of G is Abelian 
or there exists a proper subgroup G, in G, invariant under , which contains a 
non-trivial subgroup of the centre of some p-Sylow subgroup of G for every prime 


p\o(G). 


Proof. lf g is a generator of G under ¢, and if h denotes as usual the order 
of ¢, we have first of all [g],"/" = 1 and hence by Lemma 2.3 


(20) h? > o(G). 


Let pi,..., p; be the distinct primes dividing 0(G), and let P;,..., P, be 
the corresponding Sylow subgroups of G invariant under ¢. Let C,; be a minimal 
subgroup of P;, invariant under ¢, and of lowest possible order. Then if no 
Sylow subgroup of G is Abelian, the preceding lemma gives 
(21) [o(C,)]? < o(P,), 2g 2 See 2 


Define s; by the condition that 


gi = [g), 
be a generator of Ci, and let h, be the order of ¢@ on C,, i = 1,2,...,¢. Since 
C, is an Abelian group of type (p;, p;,..., pP.), we have 
(22) h, < o(C,), Sg & Gee | 
Now let A be the greatest common divisor of s;, 52, ...,: s; We may assume 


the s; are so numbered that 




















AUTOMORPHISMS WITH FEW ORBITS 87 


m t 
(23) A= bo as,— Dd bs, where a,, b, > 0. 
i=1 


t=—m+1 


We now consider the elements 


cosy * = Leslee" (eslse,) ... pent ten-t-0 (Iga) 22.) 
dm +1 pete ON ne " eaecaarae | ). 


7 [gm+ilrsm +i? 


By repeated use of Lemma 2.1, we find that 


(25) x =([g]> and y = [g]}, where u = ps as,0= psa bis, 
i=1 


i=m-+ 1 
and hence that z = y~'x = 9'"(g)p"""*"(g) ... o”™-"(g). It follows that 
(26) z= o"({g)}). 
By construction z is a power product of elements of C,, Co,...,C,, and 

hence we have ¢*(z) = z for some integer k|II,‘h,. Therefore 

| ¢ 
(27) ¢*((g}) = [g}? with k| I] A. 

i=] 


Now let G, be the subgroup of G invariant under ¢ which is generated by 
[g|’. We prove that G, is a proper subgroup of G. Suppose, on the contrary, 
that G, = G. Then ¢* is the identity on G by (27) and hence Ak. 

But then combining (21), (22), and (23), we get 


t t t 
(28) hi < J] hi< [] fo(Cd)’ < T] (Pd) = 0G), 
i=l i=1 t=1 


in contradiction to (20). 
Since \|s, for all i, [g],"f and hence C, is contained in G, for all i = 1, 2 
t, and the theorem is proved. 


COROLLARY. The same conclusion holds if we assume that h*/r > o(G) instead 
of that $" leaves only the identity element of G fixed. 


8. The structure of regular ¢-groups. We are now in a position to 
prove our main result 


THEOREM 4. Every regular $-group is nilpotent. 


Proof. Let G be a regular ¢-group of index r, g a generator of G under ¢, 
and let k be the least integer such that [g],* = 1. The proof will be by induction 
on k. 

If H is a proper subgroup of G invariant under ¢, and s the least integer 
such that z = [g]," € H, then clearly s\|k, z is a generator of H of index rs 
and [z],,"/* = 1. Hence by induction H is nilpotent. 

It suffices therefore, in view of Theorem 1, to prove that the normalizer 








88 DANIEL GORENSTEIN 


of a characteristic subgroup of some Sylow subgroup P of G contains P 
properly. As in Theorem 1, we may suppose G contains no proper character- 
istic subgroup; and hence that G is the direct product of isomorphic non-cycle 
simple groups, no subset of which is invariant under ¢. 

Let pi, po, ...,p¢ be the distinct primes dividing 0(G), and let P,, Ps, 

., P, be the corresponding Sylow subgroups of G invariant under @¢. If, 
first of all, some P, is Abelian, V(P,) < G, and hence is nilpotent. Thus P 
is in the centre of its normalizer, and it follows by a theorem of Burnside 
that G contains a normal subgroup H such that G/H = P,, contrary to the 
fact that G is its own commutator subgroup. 


Thus no Sylow subgroup of G is Abelian. If @’ left only the identity element 
of G fixed, it would follow from Theorem 3 that there exists a proper sub- 
group G, in G, invariant under @ which contains for each i = 1,2,...,ta 
subgroup C,’ of the centre of P;. Since G, is nilpotent by induction and G 
is not a p-group, N(C,’) > P;. Now N(C,’) < G and so is nilpotent. If C 
denotes the centre of P,, it follows that N(C,) > P,,i = 1,2,..., t, and by 
a previous remark this is sufficient to prove the nilpotency of G. Hence if F 
denotes the subgroup of G, left elementwise fixed by $’, F > 1. 


Let g; = [g],"* be a generator of P,, i = 1,2,...,t and define F, to be 
the subgroup of G left elementwise fixed by ¢’**. Clearly F C F, for all i. 
Suppose first there is an index i, say 1 = 1, for which o(F;) is divisible by at 


least two distinct primes. 
If Fi, < G, then F;, is nilpotent by induction. Let P,’ be the p,-Sylow 
subgroup of F invariant under @¢. As is easily seen, P;’ C P;. Suppose for some i 
p,jo(F). By Theorem 2, F; (\ P;is normal in P; and F (\ P, being invariant 
under ¢, is a characteristic subgroup of F; (\ P;, and hence is normal in F, ()\ P, 
Thus V(F (\ P,) > P, Since F, is nilpotent, it follows that F (\ P; is normal 
in Fi, and hence N(F (\ P,) > F,. It follows from our assumption on o(F;) 
that N(F(\P,) > P,. Since N(FC\P,) is nilpotent by induction, we 
conclude that N(C,) > P; which is sufficient to prove the nilpotency of G. 
Suppose instead that F; = G, so that $™ is the identity on G. If g; has 
order p", it follows that 
ren 


[gilts = gh = 1, whence (g}?”” = 1, 


and consequently 
(29) k = s,p". 

Since G is not solvable, the well-known theorem of Burnside implies 
t > 3. It follows from (29) that se\s,p" and s3\s,;p". However s: + s,, for this 
would imply that [g],"" © P:(\ P. = 1, which is not the case. Similarly 
ss; { si, and hence 


(30) Pilse, P1\Ss. 
Let G, be the subgroup generated under ¢ by 


ge’ = [ght’. 


AUTOMORPHISMS WITH FEW ORBITS 89 


By (30), Gi D P2 and G; D P;. If Gi < G, G; is nilpotent by induction, and 
consequently N(C,) > P:, from which the nilpotency of G follows. On the 
other hand, if G,; = G, g’ is a generator of G under ¢ of index rp; and 

[g’l, = 1, where k’ = k/p). 
Since k’ < k, the nilpotency of G follows by induction. 

Finally we must consider the case in which each F;, is of prime power. 
Since F C F, for all i, o(F,) is a power of a single prime, say 9, for all 
i= 1,2,...,¢. In particular, this implies F;(\ P, = 1,1 = 2,3,...,t, and 
¢"" leaves only the identity element of P, fixed. Once again t > 3. 

It follows at once from the fact that [g],* = 1 that ¢"(g) = g, and hence 
that (A/r)|k. Thus 
(31) k = mh/r 
for some integer m. 


If m = 1, h(h/r) > o(G), and it follows from the corollary of Theorem 3 
that either some Sylow subgroup of G is Abelian or G contains a proper 
subgroup G, satisfying the conditions of Theorem 3. Since both of these 
cases have been treated above, we may assume m > 1. 

On the other hand, since ¢”* leaves only the identity element of P, fixed, 


ledes = 1, & = 2,3; 
and hence 


(e)r"*” = 1. 
It follows that 


(32) mS, t= 2, 3. 


If now G*;, is the subgroup of G generated under ¢ by g* = [g],", G*; D P: 
and G*, > P; in view of (32). If G*; < G, it follows as above that N(C,) > P, 
and that G is nilpotent; while if G*,; = G, g* is a generator of G of index rm, 

(elm = 1, 
where k* = k/m, and G is nilpotent by induction. 
9. The solvability of ¢-groups. We now prove 


THEOREM 5. Every $-group is solvable. 


Proof. Let G be a ¢-group of index r with respect to a generator g, and let 
h be the order of ¢. As in § 2, we imbed G as a normal subgroup of a group G* 
which satisfies the following conditions: 


(33) G* = GA with Gf\A = 1, aya" = $(y) 
for some element a in G* of order h/ and all y in G. 
If 
y= '((g)?) 








90 DANIEL GORENSTEIN 


is an arbitrary element of G, we can represent y in G* in the form 
y = a‘[g(a’ga~") (a? ga”) . . . (a "ga-F-")a-*, 

which reduces to 

(34) y = a‘(ga’)’a~?"-* 


Setting 6 = ga’, every element of G can thus be expressed in the form 
a‘b’a~?"—* for suitable choice of i and j. If x € G*, x = ya* for some y in G 
and some integer k. It follows that 


(35) if x € G*, x = a“b*a” for suitable integers u, v, w. 

If @ leaves only the identity element of G fixed, G is nilpotent by Theorem 4; 
and so we may assume that there is a subgroup H # 1 in G which is left 
elementwise fixed by ¢. 

Let g; = [g],* be a generator of H, so that by (34) 


(36) gi = b'a~"*. 
Since $(g:) = gi, we have ag,a~' = ab*a~"*a~' = b*a~"*, whence 
(37) ab’a' = $*. 


Thus 6° commutes with a. Since 5’ obviously commutes with 3, it follows 
from (35) that 5* is in the centre of G*. Let C* be the subgroup generated 
by 6’, and set G* = G*/C*. Denoting the images of a, b, g, G in G* respectively 
by a, 6, g, G, it follows first of all that @ is normal in G*, and secondly that 
every element of G is of the form 4‘b’a-’-, while every element of G* is of 
the form a%b’a”. If ¢ denotes the automorphism of G induced by conjugation 
by a, we can reverse the steps in the derivation of (34) to conclude that 
every element of G is of the form $‘({g],’). Thus @ is a $-group; and by 
definition of 6, we have 


(38) o(9) = aga, 7 € G. 


Either ¢ leaves only the identity element of G fixed or we may repeat 
the process. Continuing this process we can always construct a sequence 
of groups 


Gi,é = 1,2,..., n with G* = Gf, * = Gi, 
satisfying the following conditions: 


(1) i = G* ‘Cs where Cc: is a cyclic subgroup of the centre of G*, 
Sm D2. s gp = Es 

(2) G* is either the identity or contains a normal subgroup G, such that 
G/G, is cyclic; 

(3) G, is a @-group in which ¢, leaves only the identity element of G, 
fixed. 


in 


yn fF 


~~ 


, 


AUTOMORPHISMS WITH FEW ORBITS 91 


By Theorem 4, G, is nilpotent. Hence G*, and conseqently G* is solvable. 
Since G C G*, it follows that G is solvable. 


Remark. Not every ¢-group is nilpotent. An example of a non-nilpotent 
¢-group is the symmetric group on 3 letters, which was discussed at the end 
of § 4. 


10. ¢-groups of index 0. In the next two sections we shall show that a 
regular @-group of prime power order is either Abelian or metabelian. In 
view of Theorem 4 this will imply that a regular ¢-group is nilpotent of 
class < 2. 

In (4, Lemma 2), it has been shown that a regular ¢-group of index 0 
is Abelian if the order of @ is relatively prime to the order of a generator 
of G under ¢. In this section we shall establish the same result without making 
any restrictions on the order of ¢. 

We have seen in Lemma 6.1 that a ¢-group G of index 0 and of prime power 
order contains a sequence of subgroups G = G, D G,.1 D... DG: D Go = 1, 
where the G, are the only subgroups of G invariant under ¢, where each G, 
is normal in G, and where x~'¢’(x) = 1 if x € G;, i =0,1,2,...,m. For 
later purposes we need to investigate ¢-groups of prime power order which 
contain such a sequence of subgroups G, satisfying the first two of these 
conditions together with following weaker third conditions: if x € G,, then 


x16"(x) € Gis, % . 4 oe 1 
We begin with the following lemma. 


LEMMA 10.1. Let G be an Abelian $-group of index r, of order p" and type 
(p", p",..., p"). Denote by G, the subgroup generated by the elements of order 
p', and assume that for every x in Gy, x~'o'(x) is in Gy4. Then if h is the order 
of @, we have h\p*—'(p™ — 1) and either n = 1 or m < 2. 


Proof. Let g be a generator of G under ¢. If m = 1, G is of order p”, of 
type (p, p,..., p), and g~'o"(g) = 1,s0 that G is of index 0, and every element 
of G is of the form ¢‘(g’). Hence the orbit under @ of g’ contains exactly h 
elements, if 0 <j < p; if the number of distinct such orbits is k, we have 
hk + 1 = p™, whence hip” — 1. 

If m > 1, we proceed by induction to prove the first part of the lemma. 
G,~1 is of type (p*"', p*"',..., p*""), of order p*-", and is invariant under 
. Since $’(g) = gy, y © Gy-1 and $’(y) = yy’, y’ € Gy», it follows by a 
direct computation that 


j(J-1)/2_9 


(39) (el? = yw g 
From (39) it follows that the least value of j for which [g],’ is in G,_, is 
j = p, and that G,_, is of index rp with respect to the generator [g],”. Since 


where y; € Gp-2. 


[x~" 9" (x)? = xo (x), 











92 DANIEL GORENSTEIN 


x € G, implies 
xp? (x) € Gy1. 
Hence we may apply induction to G,_,; to conclude that the automorphism 


p"—2(p™—1) 


gf = ¢ 
leaves G,—; elementwise fixed. 
But then 
(oi(g))? = oi(g”) = 2’, 
whence ¢:(g) = gz, with 2 = 1. But then z € G,_1, o:(z) = z and ¢)"(g) = g. 
It follows that 
i= er 
is the identity on G. 
For the second part of the lemma we need the statement: 
(40) (g]? € G1 if and only if p‘lj. 
We have proved (40) above for i = 1. If i > 1, set g, = [g],?. Since g, is 
a generator of G,_, of index rp, it follows by induction that 
[ga] € Ga-i 
if and only if p*"|k. But now by Lemma 2.1, 
[gil = lel’, 
and (40) follows at once. 
In particular, (40) implies that 


("= 1 
and that there are exactly p* — p”"' values of 7 < p” for which [g],’ has 


order ~". For these values of j the elements ¢‘(([g],’) must exhaust the 
p™ — p™*-» elements of G of order p”. Hence 


h(p" _— p”") > — pr, 
But hip*-'(p" — 1), whence 
(41) p*"(p* = 1)(p" —_ p*) >pu@ — pre», 


It follows that p — 1 > p™*—-?-*"*?, and we conclude from this inequality 
that either m = 1 or m < 2. 

The following theorem will be of considerable importance in determining 
the structure of a regular ¢-group. 


THEOREM 6. Assume that a regular $-group G of index r and order p* contains 
a sequence of normal subgroups G = G, D Gai D...D Gi: D Go = 1, im- 
variant under , such that no subgroup of G invariant under @ lies properly 
between G, and G,_; and such that if x © G,;, x~'6"(x) € Goi, i = 1,2,...,m. 
Then G is Abelian. 


AUTOMORPHISMS WITH FEW ORBITS 93 


Proof. We shall first show that all the elements of order p in G are contained 
in G:, and hence that G, lies in the centre of G. G = G/G, satisfies all the 
conditions of the lemma, and hence by induction the elements of order p in 
G are contained in the subgroup G2 = G2/G, which is Abelian of type (p, p, 
...,p). Hence the elements of order p in G are contained in G2. Since G, is 
a minimal subgroup of G invariant under ¢, it is also Abelian of type (p, p, 

., p). 

We have G, of index rs with respect to some generator ge, g2” € Gi, and 
'(g2) = yg2 for some y in G,. Since G; is normal in Gz», it follows directly 
that 


[gels = 2,83, z, in Gi. 
If go has order p*, we conclude at once from this relation that the elements 
of order p in G2 are contained in G;. 

On the other hand, suppose g.” = 1. First of all, if G,; were not in the centre 
C of Gs, we would have G,/\C = 1 and GiC/G, = G2, since no proper 
subgroup of G; is invariant under ¢. But then G; = G,C, and so G: is Abelian. 
G,; must therefore lie in the centre of Ge. But now if $"*(g2) = ge, z © Gi, it 
follows that 


(gel, = 2 "gi, 


If p is odd, we conclude that [g2],,? = g2” = 1, a contradiction to the fact 
that G, is spanned by the elements of the form 


¢'([gl*s). 
If p = 2, (42) gives [g2],,4 = 1, and so the orbits of the four elements [g],,’, 
j = 1,2,3,4 must span G». But 


(goles = 2g. = $'*(g2) 


since g2* = 1, and hence [ge],,' and [g],,* determine the same orbit. It follows 
that the orbit of g. under @ must include every element of G, — Gi, whence 


(43) h > 0(G2) — 0(G)). 


Since our assumptions imply that every element of G2 is of order 2, G» is 
Albelian and we may regard ¢ as a linear transformation of an n-dimensional 
vector space (2" = 0(G:2)), over the field with 2 elements, which leaves some 
t-dimensional subspace invariant (2‘ = 0(G,)). But the maximum order of 
such a linear transformation is easily computed to be (2‘ — 1)(2*-' — 1), 
which is less than 2" — 2‘, in contradiction to (43). Hence G, consists of the 
elements of order dividing p in G, as asserted. 

If o(G,) = p, G therefore has a unique subgroup of order p, and as is 
well-known, this implies that G is either cyclic or isomorphic to the generalized 
quaternion group of order 2*. But this last group has a unique element of 
order 2, which is necessarily fixed by every automorphism of the group. 








94 DANIEL GORENSTEIN 


Hence G is Abelian if G; is cyclic. We assume therefore that 0(G,) = p‘ with 
‘> 3. 

We consider the group G = G/G;, and suppose k to be the order of the 
image @ of @ on G. If F denotes the subgroup of G left elementwise fixed 
by ¢*, we have F(\G, = 1 or F(\G, = G,, since G,; is a minimal subgroup 
of G invariant under ¢ and since F is also invariant under @. Since G; contains 
every element of order p in G, F(\G, = 1 implies F = 1. 

Consider the case F = 1. If g is a generator of G, $*(g) = aig, 2: in Gi; 
and since ¢* leaves only the identity element of G, fixed, 2; = x~'¢*(x) for 
some x in G;, and ¢*(x~'g) = x~'g. Thus x~'g € F, whence g = x. Thus 
G = G;, is Abelian. We may thus suppose F D G. 

Now G satisfies all the hypotheses of the theorem and is Abelian by in- 
duction, But then it satisfies the conditions of Lemma 6.1, and consequently 
is either cyclic, of type (p*"', p*"') or of type (p, p,...,p) and order p™ 
with kip” — 1. If G is cyclic, G is of course Abelian, since G; is in its centre. 
In the second case, it follows that every element of G is of the form xg‘¢(g)’ 
for some element x in G, and suitable integers i, 7. Suppose now that 


(44) o(g)g = yvee(g), y in G,. 
Since G is of type (p*"', p”-"), (9) = 9°G(9*) for some integers a, 8, where 

g denotes the image of g in G. Hence 

(45) ¢*(g) = 2g%(g*), z€ Gi. 
Now apply ¢ to (44) and use (45) to obtain 


(46) ¢7(g)o(g) = o(y)o(g)¢7(g) = o(y)o(g)2g%6(g*) 
= o(y)y*(zg"o(g"))o(g) = o(y)y*b?(z)o(g). 


Hence ¢(y) = y~*, and the subgroup H generated by y is invariant under 
@. Since H C Gi, we have H = G, or H = 1. In the first case, 0(G,) = p, 
contrary to our present assumption. Hence y = 1 and it follows at once 
from (44) that G is Abelian. 

There remains the case F > G,, G Abelian of type (p, p,..., p) and order 
p™, with kip™ — 1. In this case the relation ¢*(g) = 2g implies ¢*’(g) = g, 
whence 


(47) h\kp|(p™ — 1)p. 
On the other hand, as in the proof of Lemma 10.1 
[g]?* = 1, 
and hence 
(48) hp? > o(G) = 0(G,)o(G) = p**". 


Combining (22) and (23), we get the inequality (p" — 1)p* > p‘*", which 
implies t < 2. We are assuming ¢ > 1 and hence ¢ = 2. 


AUTOMORPHISMS WITH FEW ORBITS 95 


The theorem has already been proved if m < 2. Hence we may assume 
m > 3. Let 91, G2, ..., Gm be a basis for G and y:, y2,..., ¥m a set of repre- 
sentatives in G. Since G, contains all elements of G of order p, y? # 1 for 
all i. Since y? € G, and G, is of type (p, p), there exists integers y, and y:2 
such that 


(49) ys = yT'yt”. 
On the other hand, if 


aoe 


ys(yi'y2?) = xuyi'ys'ys and yryi = xoyi' yr, 
then 


(50) (ya yi ys)” - A PtPP (ys My") yh - (20 59) 7 Ot ys ys Py’. 


If p is odd, it follows at once from (49) and (50) that y.~%y,-7y, has 
order p and hence is in G;. We conclude that 9; = §,"9."*, which implies 
m <2, a contradiction. 

On the other hand, if p = 2, and @ is the residue of g in G, it follows as 
in the first part of the proof that [g],4 = 1, that [g],* and [g],' determine 
the same orbits, and hence that the orbit of g under @ must include all 
27(2" — 1) elements of G — G,, and hence 


(51) h > 22(2" — 1). 


On the other hand, since every element of G; is of the form #‘(g,’), @ has 
order 3 on G,. Since FD Gi, 3\k. But then ¢*(g) = xg, x € Gi, implies 
@™*(g) = g, whence h\2k. Since k < 2" — 1, A < 2(2" — 1) in contradiction 
to (51). 


COROLLARY. A regular o-group of index 0 and of prime power order is 
Abelian. 


The structure of ¢-groups of index 0 is now easily obtained. 
THEOREM 7. A regular o-group of index 0 is Abelian. 


Proof. lf G is of index 0, so is every one of its subgroups. Since ¢ is regular, 
¢@ leaves some p-Sylow subgroup of G invariant for every p\o(G). It follows 
from the preceding corollary that the Sylow subgroups of G are all Abelian. 
and hence by the corollary of Theorem 1, that G is Abelian. 


11. The structure of regular ¢-groups of prime power order. 


THEOREM 8. A regular $-group of prime power order is either Abelian or 
metabelian. 


Proof. Let G be a regular ¢-group of index r and order p. We shall first 
prove that G contains a normal subgroup F* invariant under ¢ and of index 
rs such that 








96 DANIEL GORENSTEIN 


(a) F* satisfies the hypotheses of Theorem 6. 

(6) G = G/F* is Abelian of type (p, p,..., p). 

(c) The image ¢’ of ¢’ leaves only the identity element of G fixed. 
(d) If k = order of ¢, then (k, p) = 1 and irs. 


We shall then show that F* is actually in the centre of G. 

Suppose first that $’ leaves some proper subgroup F of G elementwise 
fixed. By Theorem 2, F is normal in G. Let G = G/F. By induction G contains 
a subgroup F* of index rs such that F*, G = G/F*, and the image ¢ of ¢ on 
G satisfy conditions (a) to (d). If F* denotes the inverse image of F* in G, 
F* is of index rs. Since F is of index 0, it follows readily from Lemma 6.1 and 
condition (a) for F* that F* satisfies (a). Since G/F* & G, the remaining 
conditions follow at once. 

We may therefore assume that ¢’ leaves only the identity element of G 
fixed. If G is Abelian of type (p, p,..., ) and (h, p) = 1, where h = order 
of ¢, set F* = 1. 

If G is not of this form, let C, be a minimal subgroup of the centre of G, 
invariant under ¢, and set G = G/C,, 6 = image of @ on G. If G is Abelian 
of type (p, p,...,) and the order m of @ is relatively prime to p, we let 
H be the subgroup of elements of G left elementwise fixed by @”. If H(\ C,; = 1 
it follows by the usual argument that G = C,H, that H &G, and conse- 
quently that G is Abelian of type (p, p,..., p). Since C, is a minimal sub- 
group of G invariant under @, the order of @ on C, is relatively prime to p, 
and it follows at once that (hk, p) = 1, a contradiction. Thus H D Cy. 


Let g be a generator of G of index r and g; = [g],* a generator of C,; of 
index rs. If g is the residue of g in G, [g]," = 1, and since ¢ leaves only the 
identity element of G fixed, it follows that 67°(g) = g. Thus mjrs.Since H D C, 


we conclude that x~'o"*(x) = 1 for all x in C;. If we put F* = C,, it is clear 
that conditions (a) to (d) hold. 

Consider then the case in which either G is not Abelian of type (p, p, ... , p) 
or (m, p) ¥ 1 so that G contains at least one proper normal subgroup invariant 
under ¢. By induction G contains a proper normal subgroup F* of index rs 
such that if G = G/F*, 6 = image of d on G, and k = order of ¢, then F* 
satisfies the conditions of Theorem 6, G is Abelian of type (p, p,..., >), 
(k, p) = land Rjrs, and @¢’ is Frobenius. Our conditions imply that 6"*(g) = 29, 


# € F*. It follows as in the derivation of (39) and (40) that 
o”""(9) = 9 

for some integer n, and hence 

(52) m\rsp”. 

Let H be the subgroup of G left elementwise fixed by ¢’”", and suppose 
first that H > C,. Let F* be the inverse image of F* in G. The index of F* 
= index of F* = rs. Furthermore $’"(x) = x for all x in C,. Since C, is 
a minimal subgroup of G invariant under ¢, the order of ¢ on C;, is relatively 


a ee a ee ee 


AUTOMORPHISMS WITH FEW ORBITS 97 


prime to p, and hence x~'¢"*(x) = 1 for all x in C;. It follows immediately 
that F* satisfies (a). Since G/F* = G/F*, (b), (c), and (d) also hold. 

On the other hand, if H/\C, = 1, it follows once again that G = C,H 
and that G & H under an isomorphism 7 such that (r6(Z)) = $(r(Z)) for all 
# in G. Let F’ be the normal subgroup of H which corresponds to F* under 
r. Then F’ is invariant under ¢, @ has order m on H and m|rsp. Let F, be a 
minimal subgroup of F’ invariant under @. Since every subgroup of F’ invariant 
under ¢ is characteristic in F’, F,; is normal in H and hence also in G. Let 
G’ = G/F,, ¢’ = image of ¢ on G’, m’ = order of ¢’. By induction G’ contains 
a normal subgroup F’* of index rs’ such that conditions (a), (b), (c) hold 
for F’* and G = G/F’*. In particular, m’ = rs’p” for some integer n’. Let 
H, be the subgroup of G invariant under 


gr" 


Since F* is the homomorphic image of C,F’, C,F’ is of index rs. Since F; C C,F’, 
it follows that rs|rs’, and hence H,; D F;. Our desired conclusion now follows 
as in the preceding paragraph. 

It remains to prove that F* lies in the centre of G. By construction F* 
contains a sequence of normal subgroups F* = F, > Fy-1D...D FiDd Fo =1 
invariant under @ such that 

x16" (x) € Fy if x € F, 


and such that no proper subgroup of F* invariant under ¢ lies properly be- 
tween F, and F,_,;. By Theorem 6, F* is Abelian. It is easy to see that this 
implies that F* is of type (p", p”,..., p") and that F, is the subgroup generated 
by the elements of order p‘ in F*. Thus F; is characteristic in F*, and conse- 
quently normal in G. Since F* is a minimal subgroup of G invariant under ¢, 
we conclude that F; lies in the centre of G. 

Since G is Abelian of type (p, p,..., p) we can decompose G into the 
direct product of subgroups G,, 7 = 1, 2,...,¢ invariant under $” and none 
of which can be further decomposed into proper subgroups invariant under 
$"*. If G, denotes the inverse image of G,, it suffices to prove that F* lies in 
the centre of each G,. For definiteness, take j = 1. 

First of all, if 6" has non-trivial fixed elements on G,, it follows from the 
minimality of G, that $” is in fact the identity on G:. Hence if x € Gi, 
x~'o"*(x) € F*. It follows at once that G, is a group of index rs satisfying 
the conditions of Theorem 6, and hence is Abelian. Thus F* is in the centre 
of G, in this case. 

Consider then the case in which $” leaves only the identity element of 


G, fixed. G, has a basis 9, ..., 4%, such that 

O° (Fs) = Via, i=1,2,...,¢q—1 
and 
(53) $""(y,) = Hi'Ia"... Te" 


for suitable integers a, a2, ... , a. 








98 DANIEL GORENSTEIN 


If 
FFs... Fe" 
is a fixed element of $*, then it is easily checked that the integers i;, is, . . . , i, 
are a solution of the congruences. 
(54) Orbe = 11; at, + 1) = A203... Ogte + te = ty (mod p), 


and conversely. It follows readily from (54) that $"* is Frobenius on CG, if 
and only if 
(55) (a, +a2+...+a,—1,p) = 1. 
Let y, be a representative of 9, in G, such that 
O° (ys) = Vern .™ Emp iac eG = 5. 
Then 
oa 


o'' (ye) = xoyi'y2". . . V0", 


“@ 


xo € F*. If w denotes the automorphism of F* induced by conjugation by y,, 
y leaves F, elementwise fixed since F; lies in the centre of G:. We shall prove 
by induction on mn that y leaves F* elementwise fixed. This will suffice to 
prove that F* is in the centre of G,, and will complete the proof of the theorem. 
By induction F*/F;, lies in the centre of G/F,, whence 
(56) if x € F*, w(x) = 2x, 2€ Fy. 
Suppose y is the identity on F, with 1 < k <n. We shall prove y is the 
identity on F,4,. Applying $7" to (56), we obtain 
(57) o”**(W(x)) _ o”'(y:) 0" *(x) o"**(y7') - o"""*(2) o"""(x) - zo""'(x). 
But if x € Fyyi, 7*'(x) = x2,, 2, € Fy. Since F;, is in the centre of G1, we con- 
clude from (57) that 


(58) o"**(W(x)) = 2x = w(x) for all ¢ and all x in Fy4y. 


By repeated use of (58) we now obtain 


V(x) = O(W(x)) = (xayiiye? . . . ve") (x) (xayi'y2? . .. yet) = Ye Fe(x), 
whence 
(59) Yritert.--tag—i(x) = x, x € Fey. 


On the other hand, (56) implies Y(x) = x. But then (55) and (59) together 
imply ¥(x) = x for all k in Fy4;. Q.E.D. 
Theorem 8 and Theorem 4 together imply 


THEOREM 9. A regular $-group is either Abelian or nilpotent of class 2. 

We conjecture that a regular ¢-group is Abelian if ¢’ is Frobenius. This result 
would follow easily from the following conjecture concerning fixed-point free 
automorphisms of p-groups. 





ult 


ree 


AUTOMORPHISMS WITH FEW ORBITS 99 


ConjJecTuRE. Let G be a non-Abelian p-group which admits an automorphism 
@ of order h leaving only the identity element of G fixed, and assume that G cannot 


be expressed as the direct product of two proper subgroups invariant under 9. 
Then h® < o(G). 


12. The relation between ¢-groups and groups of the form ABA. 

In the proof of the preceding theorem we have already seen that a ¢-group 
G can be imbedded as a normal subgroup of an ABA-group G* satisfying 
G* = GA and G/\A = 1. The converse is also true, and consequently we 
have 


THEOREM 10. G is a $-group if and only if it can be imbedded as a normal 
subgroup of a group G* of the form ABA, where A and B are cyclic subgroups 
of G*, such that G* = GA and A (\G = 1. 


Proof. \t suffices to prove that if an ABA-group G* in which A, B are cyclic 
contains a normal subgroup G such that G* = GA and G/\A = 1, then G 
is a $-group. 

If a,6 are generators of A, B respectively, we have 6 = ga’ for some 
element g in G and some integer r. Since G is normal, the elements 
(60) bia~*" = (ba~")(a’ba~*’) .. . (a "ba~*") = gla’ga~’)... 

a-)' ga (J-1)F 
are in G for all i, j. 

Suppose for some j, b’a* € G; then a~*-’" = (b/a*)~'(b’a~*") € GVA. 
Since G/\ A = 1, a* = a~’’. It follows at once that G consists precisely of 
the elements of G* of the form a‘b’a~’’—'. If we now define ¢ to be an auto- 
morphism of G induced by conjugation by a, it follows as in the proof of 
Theorem 5 that every element of G is of the form ¢‘([{g],’). Thus G is a ¢-group 
of index r and with generator g. 

Combining Theorems 5 and 10, we obtain our final result: 


THEOREM 11. A group G* which is of the form ABA, where A and B are 
cyclic subgroups, and which contains a normal subgroup G such that G* = GA 
and G/\A = 1 is solvable. 


In a subsequent paper we shall show that an ABA group G* with a trivial 
centre in which A is its own normalizer and A is of odd order always contains 
a normal subgroup G such that G* = GA and G/\A = 1. We shall also 
determine the structure of G* when o( A) is even and, in particular, shall show 
that G* is solvable. 








100 DANIEL GORENSTEIN 


REFERENCES 


1. W. Burnside, Theory of groups of finite order (Dover, New York, 1955). 

2. W. Feit, On the structure of Frobenius groups, Can. J. Math., 9 (1957), 587-96. 

3. D. Gorenstein, A class of Frobenius groups, Can. J. Math., 11 (1959), 39-42. 

4. D. Gorenstein and I. N. Herstein, A class of solvable groups, Can. J. Math., 11 (1959), 
311-20. 

5. G. Higman, Groups and rings which have automorphisms without non-trivial fixed elements, 
J. London Math. Soc., 32 (1957), 321-334. 

6. H. Zassenhaus, The theory of groups (Chelsea, New York, 1958). 


Clark University 


SUR LE RADICAL CORPOIDAL D’UN ANNEAU 
G. THIERRIN 


Le radical d'un anneau, dans le sens général de N. Jacobson (1; 2), peut 
étre caractérisé de plusieurs manié¢res. On peut, par exemple, le définir comme 
l'intersection de tous les idéaux primitifs de l’'anneau envisagé. Dans ce 
travail, nous considérons une classe particuliére d’idéaux primitifs, la classe 
des idéaux corpoidaux: un idéal est dit corpoidal si et seulement si l’'anneau- 
quotient correspondant est un corps.' L’objet principal de ce travail est de 
donner quelques caractérisations de l’intersection C de tous les idéaux corpoi- 
daux d'un anneau A, intersection appelée le radical corpoidal de l’anneau A. 
Si C est distinct de A,. l’anneau-quotient A/C est isomorphe 4 une somme 
sous-directe de corps Dans le cas d'un anneau commutatif, tout idéal primitif 
est corpoidal, et par conséquent son radical corpoidal coincide avec son radical.? 


1. Anneaux et modules interversifs. Un anneau A est dit interversif 
a droite, si l'on a 


abA = baA, quels que soient a, b € A. 


Un idéal H d’un anneau A est dit interversif ad droite, si l'anneau-quotient 
A/H est interversif 4 droite. On voit facilement qu'un idéal H est interversif 
a droite, si et seulement si pour tout triple a, b,c € A, il existe x © A tel 
que l'on ait 


abc = bax (A). 


Il s’ensuit que tout idéal contenant un idéal interversif A droite est interversif 
a droite. En particulier, tout idéal d’un anneau interversif a droite est inter- 
versif a droite. 

Rappelons qu'un anneau A est dit réflecteur a droite, si, pour tout idéal a 


droite H de A, la relation ab © H entraine ba € H (3). 


THEOREME 1. Pour qu'un anneau A possédant un élément unité soit inter- 
versif a droite, il faut et il suffit qu'il soit réflecteur a droite. 


La condition est nécessaire. Si ab € H, ot H est un idéal a droite de A, on 
aabA = baA CH. D'od ba € H. 

La condition est suffisante. Soient a, b,c € A. L’anneau A étant réflecteur 
4 droite et possédant un élément unité, il existe, d’aprés (3), un élément «x tel 


Recu le 15 Décembre, 1958. 
' Par corps, nous entendons un anneau dont l'ensemble des éléments distincts de zéro forme 
un groupe pour la multiplication. 


*La nomenclature de ce travail est celle de N. Jacobson (1) 


101 








102 G. THIERRIN 


que l'on ait ab = bax. Par conséquent, abc = baxc et l’idéal (0) est inter- 
versif a droite. 

Remarquons que tout idéal a droite d'un anneau interversif 4 droite possé- 
dant un élément unité est un idéal bilatére. 

Un A-module* M est dit interversif, si l'on a 


uabA = ubaA, quels que soient u € M,a,b€ A. 
Si l’'anneau A est interversif a droite, tout A-module est évidemment interversif. 


THEOREME 2. Pour qu'un anneau A soit un corps, il faut et il suffit qu'il 
existe un A-module M irréductible, fidéle et interversif. 


La condition est nécessaire. En effet, il suffit de prendre M = A. 
La condition est suffisante. Le A-module M étant irréductible, si0 # u € M, 
on a, d’'aprés (1, ch. 1), M = uA 2A — (0: u), od (0: u) = J est un idéal 


4 droite modulaire maximal. Soient t € J et x € A; onatxA C Jet utxA = 
uxti1=0. D’ot xtA C J. L’ensemble 
J°.A=f{ala€ A,aA CJ} 


est un idéal a droite de A et J C J°.A.On a donc soit J°.A = J, soit 
J°.A=A.SiJ°.A = A,ona A? CJ, ce qui est impossible puisque J est 
modulaire. Par conséquent J°.A = J. Comme xt € J*.A, on a xt € J et 
donc J est un idéal bilatére. Soit N = {vlv € M,vJ = 0}. Cet ensemble V 
est un sous-module de M et u € N. Donc N # 0 et, puisque M est irréduc- 
tible, V = M,ce quientraine J C (0: M). Comme M est fidéle, on a (0: M) = 0. 
Par conséquent, J = 0 et A est un corps. 


COROLLAIRE. Pour qu'un anneau A soit un corps, il faut et il suffit qu'il soit 
primitif et interversif a droite. 


La condition est évidemment nécessaire. Elle est suffisante. En effet, l'anneau 
A étant primitif, il existe un A-module M irréductible et fidéle. Comme A 
est interversif 4 droite, le module M est interversif, et donc A est un corps, 
d‘aprés le théoréme. 

Un idéal K d’un anneau A est dit corpoidal, si |’'anneau-quotient A/K est 
un corps. On voit facilement qu'un idéal est corpoidal, si et seulement s'il 
est un idéal a droite modulaire maximal. 


THEOREME 3. Pour qu'un idéal K d'un anneau A soit corpoidal, il faut et il 
suffit qu'il existe un A-module M irréductible et interversif, tel que l'on ait 
K = (0: M). 

La condition est nécessaire. L’anneau-quotient A/K étant un corps, il 
existe, d’aprés le théoréme 2, un A/K-module M irréductible, fidéle et inter- 
versif. Mais ce A/K-module M peut aussi étre considéré comme un A-module. 








3Dans ce travail, le terme “‘module”’ signifie toujours module a droite. 





1 





LE RADICAL CORPOIDAL D’'UN ANNEAU 103 


Le A-module M est également irréductible et l'on a K = (0: M). Montrons 
que le A-module M est interversif. Soient a,b € A etu € M. Siu =0, ona 
évidemment uabA = ubaA = 0. Soit u # 0. Si ab € K, alors ba € K. De 
abA © K et baA CK suit uabA = ubaA = 0. Si ab¢ K, alors ba ¢ K, abA 
Z K et baA J K. Comme K est un idéal a droite maximal, on a 


A =K+abA = K + bad. 


L’élément u étant différent de zéro, on a, puisque M est un A-module irré- 
ductible, uA = M. D’od 


uA = uK + uabA = uK + ubaA = M. 


Comme uK = 0, on a par conséquent 


uabA = ubaA = M. 


La condition est suffisante. En effet, M peut étre considéré comme un 
A/K-module irréductible et fidéle, car on a K = (0: M) et M est un A- 
module irréductible. On voit d’autre part facilement que M, considéré comme 
A/K-module, est aussi interversif. Parconséquent l’anneau-quotient A/K est, 
d’aprés le théoréme 2, un corps. 


2. Radical corpoidal. Soient A un anneau quelconque et > l'ensemble 
des A-modules M, irréductibles et interversifs. Le noyau C de }., c’est-a-dire 
l'ensemble C = (\{ (0: M,)|M, € ¥} est appelé le radical corpoidal de A. Si 
> est vide, le radical corpoidal de A est, par définition, A lui-méme. 

Si R est le radical* de A, RC C. Si A est interversif A droite, R = C. 


THEOREMF 4. S’il est distinct de A, le radical corpoidal C d'un anneau A 
est l’intersection des idéaux corpoidaux de A. 


C’est immédiat, d’aprés le théoréme 3. 
Un anneau A est dit c-semi-simple, si A # 0 et si le radical corpoidal de 
A se réduit a 0. 


THEOREME 5. Si le radical corpoidal C del’anneau A est distinct de A, l'anneau- 
quotient A/C est c-semi-simple. 


L’intersection des idéaux corpoidaux de A étant C d’aprés le théoréme 4, 
il s’ensuit facilement que l’intersection des idéaux corpoidaux de A/C se 
réduit a zéro. Donec A/C est c-semi-simple. 


THEOREME 6. Un anneau A est isomorphe a une somme sous-directe de corps, 
si et seulement s'il est c-semi-simple. 


Si A est c-semi-simple, |’idéal (0) est l’intersection des idéaux corpoidaux 
K, de A. Par conséquent, A est isomorphe 4 une somme sous-directe des 


‘Les notions de radical et de semi-simplicité d’un anneau sont, dans ce travail, prises dans le 
sens général de Jacobson (1, 2). 








104 G. THIERRIN 


anneaux-quotients A/K, qui sont des corps. Inversement, il est immédiat que 
si A est isomorphe 4 une somme sous-directe de corps, A est c-semi-simple. 


3. Eléments c-quasi-réguliers. Si a est un élément fixé d’un anneau 
A, on désigne d’aprés Jacobson (1) (méme si A ne contient pas d’élément 


unité) par (1 — a)A lidéal a droite {x — ax|x € A} et par A(1 — a) l’idéal 
a gauche {x — xa|x € A}. 
Un élément z de A est dit c-quasi-régulier, si l'idéal 4 droite (1 — z)A n'est 


contenu dans aucun idéal corpoidal de A. 


THEOREME 7. Un élément z de A est c-quasi-régulier si et seulement si l’idéal 
a@ gauche A(\ — 2) n'est contenu dans aucun idéal corpoidal de A. 


La démonstration de ce théoréme découle immédiatement du lemme sui- 
vant: 


LEMME 1. Si K est un idéal corpoidal de l‘anneau A, les relations x — ax © K 
et x — xa © K sont équivalentes. 


Montrons par exemple que x — ax € K entraine x — xa € K. Si x € K, 
c'est immédiat. Si x ¢ K, il existe, puisque K est corpoidal, des éléments e et 
x’ tels que ye = y(K) pour tout y € A et xx’ = e(K). De x — ax € K suit 
x = ax(K), xx’ = axx'(K), e = ae =a(K). D’ot x = xe = xa(K), c’est-a- 
dire x — xa € K. 


Remarquons que tout élément quasi-régulier 4 droite ou 4 gauche est c- 
quasi-régulier. 


THEOREME 8. Un élément z de A est c-quasi-régulier si et seulement si l’idéal 
A est le seul idéal interversif a droite contenant (1 — z)A. 


Supposons que z soit c-quasi-régulier. S’il existe un idéal interversif a 
droite H différent de A contenant (1 — z)A, cet idéal H est modulaire, donc 
contenu dans un idéal a droite (modulaire) maximal J. L’idéal P = J.° A 
= {ala € A, Aa CJ} est primitif et H C P. Comme A est interversif a 
droite, P l’est également. Par conséquent, l'anneau-quotient A/P est primitif 
et interversif 4 droite, donc un corps d’aprés le corollaire du théoréme 2. L’idéal 
P est par suite corpoidal et contient (1 — z)A, contre l’hypothése. Inverse- 
ment, il est immédiat que si A est le seul idéal interversif 4 droite contenant 
(1 — 2)A, il n’existe pas d’idéal corpoidal contenant (1 — 2)A, car tout idéal 
corpoidal est interversif a droite. 

Un idéal a droite de A est dit c-quasi-régulier, si tous ses éléments sont 
c-quasi-réguliers. 


THEOREME 9. Le radical corpoidal C d'un anneau A est un idéal c-quasi- 
régulier, contenant tout idéal a droite c-quasi-régulier. 


Soit z € C. Si z n’est pas c-quasi-régulier, (1 — z)A est contenu dans un 
q g 





are © 








LE RADICAL CORPOIDAL D’UN ANNEAU 105 


idéal corpoidal K. D’aprés le théoréme 4, z © K. Si x est un élément quel- 
conque de A, on a x — ex € K et donc x € K. D’od K = A, ce qui est im- 
possible. Par conséquent, tout élément de C est c-quasi-régulier. 

Soit T un idéal a droite c-quasi-régulier et soit z © T. L’élément 2x est 


c-quasi-régulier pour tout x © A. Siz ¢ C, il existe un A-module M irréductible 
et interversif tel que z ¢ (0 : M). Posons K = (0 : M); K est un idéal corpoidal 
d’aprés le théoréme 3. II existe u © M, tel que uz ~ 0 et l'on a uzA = M, 
puisque M est irréductible. Par conséquent, il existe a € A tel que uza = u. 
L’élément za étant c-quasi-régulier, on a donc (1 — 2a)A Z K. L'idéal K 
est un idéal 4 droite maximal; d’ot (1 — 2a)A + K = A. Il existe donc x € A 
et k © K tel que l'on ait x — zax +k = — za, c’est-a-dire za + x — zax = 
—keé«K. On a d’autre part 0 = u — uza — (uw — uza)x = u — u(sea+x 
— zax) = u+ uk. Dek © K suit uk = 0 et donc u = 0, ce qui est impossible, 


puisque uz ~ 0. Par conséquent, z © Cet TCC. 


4. Noyau d’interversion. Un complexe’ HW d'un anneau quelconque A 
est dit un complexe d’interversion a droite, si l'on a abH = baH, quels que 
soient a, 6 € A. L’élément zéro est un complexe d’interversion a droite. Tout 
idéal A droite minimal D, qui est un idéal bilatére, est un complexe d’inter- 
version a droite; cela découle du fait que l’on a alors xD = 0 ou xD = D, 
pour tout x € A. 

On voit facilement qu'un complexe H est un complexe d’interversion a 
droite, si et seulement si pour tout couple a, 6 € A et tout h € H, il existe 
h’ © H tel que l'on ait abh = bah’. 

La réunion T de tous les complexes d’interversion 4 droite de A est appelée 
le noyau d’interversion a droite de A. 


THEOREME 10. Si H et K sont des complexes d'interversion a droite de l'anneau 
A, les complexes HA, AH, H* ={—h/h€ WH} et H+K = {h+kh © H, 


k © K} sont des complexes d’interversion a droite. 


De abH = baH, quels que soient a, 6 € A, suit abHA = baHA. Donec HA 
est un complexe d’interversion 4 droite. 

Soit ensuite vh € AH, avec v © A, hE H. Ul existe h’ € H tel que 
abvh = bvah’ et h” € H tel que vah’ = avh"’. D'od abvh = bvah' = bavh", 
avec vh"’ © AH. Par conséquent, 4H est un complexe d’interversion a droite. 

Le complexe H* est un complexe d’interversion a droite, car, si h € H, il 
existe h’ € H tel que abh = bah’; d’od ab(— h) = ba(— h’). Montrons enfin 
que H + K est un complexe d’interversion 4 droite. Si A © H, k © K, il 
existe h’ © H, k’ € K tels que abh = bah’ et abk = bak’. D’ot ab(h + k) = 
ba(h’ + k’). 


THEOREME 11. Le noyau T d’interversion a droite d'un anneau A est un 


‘Par complexe d'un anneau A, nous entendons toute partie non vide de A. 








106 G. THIERRIN 


complexe d'interversion a droite, un idéal bilatére et un anneau interversif a 
droite. 


Soient a,b € A et ¢ € T. Il existe un complexe d’interversion 4 droite H 
contenant I’élément ¢. Par conséquent, il existe t/ € H C T tel que abt = bat’, 
ce qui montre que 7 est un complexe d’interversion a droite. 

Soient x, y € T. Du théoréme 10 et du fait que T est un complexe d’inter- 
version 4 droite contenant tous les complexes d’interversion a droite de A, 
on déduit que — y € Tet x — y € T, ce qui montre que T est un sous-groupe 
additif. Les complexes AT et TA étant, d’aprés le théoréme 10, des com- 
plexes d’interversion 4 droite, on a donc AT C T et TA C T. Par conséquent, 
T est un idéal bilatére. 

Il est immédiat que T est un anneau interversif 4 droite. 


THEOREME 12. Si R est le radical et T le noyau d’interversion a droite d'un 
anneau A, le radical corpoidal de lanneau T est l'ensemble T C\ R. 


Le noyau 7 étant un idéal bilatére, son radical est, d’aprés Jacobson (1, 
ch. I), l'ensemble T (\ R. Comme T est un anneau interversif 4 droite, son 
radical corpoidal coincide avec son radical. 


CoROLLAIRE. S’il est distinct de zéro, le noyau d’interversion a@ droite d'un 
anneau semi-simple est isomorphe d une somme sous-directe de corps. 


BIBLIOGRAPHIE 


1. N. Jacobson, Structure of rings, Amer. Math. Soc. Coll. Pub., 37, 1956. 

2. ———— The radical and semi-simplicity for arbitrary rings, Amer. J. Math., 67 (1945), 
300-320. 

3. G. Thierrin, Contribution a la théorie des anneaux et des demi-groupes, Comm. Math. Helv., 32 
(1957), 93-112. 


Université de Montréal 








ON REPRESENTATIONS OF ORDERS OVER 
DEDEKIND DOMAINS 


D. G. HIGMAN 


We study representations of o0-orders D, that is, of o-regular D-algebras, in 
the case that 0 is a Dedekind domain. Our main concern is with those - 
modules, called D-representation modules, which are regular as 0-modules. For 
any ©-module M we denote by D(M) the ideal consisting of the elements 


x € osuch that x-Ext' (M, NV) = 0 forall D-modules N, where Ext Ext (D.0) 


is the relative functor of Hochschild (5). To compute D(M) we need the small 
amount of homological algebra presented in § 1. In § 2 we show that the 
©-representation modules with rational hulls isomorphic to direct sums of 
right ideal components of the rational hull A of ©, called principal D-modules, 
are characterized by the property that D(M) #0. The (©, 0)-projective 
©-modules are those with D(M) = 0. We observe that D(M) divides the 
ideal J(D) of (2) for every M, and give another proof of the fact that J(D) # 0 
if and only if A is separable. Up to this point, 0 can be taken to be an arbitrary 
integral domain. 

The results of the remaining sections are largely generalizations of Maranda’s 
results for groups (6, 7). In §§ 3-5 ew assume that 0 is a local domain with 
prime ideal p, and define the depth of an D-module M to be s or © according 
as D(M) = »* or 0. In § 3 we generalize Maranda’s Theorem 2 of (6) by 
proving that an -representation module M of depth s is isomorphic with 
an -representation module N if and only if M/p*t'M and N/p*t'N are 
isomorphic. In § 4 it is proved, among other things, that for complete 0, an 
©-representation module M has depth s if and only if M/p’t'M has depth s. 
This implies, for example, that M is (©, 0)-projective if and only if M/pM 
is (O/pO, 0/p)-projective, a slight improvement of a result of Reiner (8), 
since the “only if” part does not require a special hypothesis. In § 5, 0 is 
assumed complete, and the (©, 0)-projective ©-representation modules are 
characterized as being isomorphic with direct sums of indecomposable right 
ideal components of ©. A generalization of Maranda’s Theorem 4 of (7) 
states that if J(O/R) = 0, two (QO, 0)-projective O-representation modules 
are isomorphic if and only if their rational hulls are isomorphic. Here ® is 
the intersection of © with the radical of A. 

In the final § 6 we apply the local results to the case of a general Dedekind 
domain 0, observing that for a principal O-module M, D(M) = Il,p', where 
the product is over all primes p of 0, and s is the depth of M in the p-adic 
completion of 0. We denote by Sy a complete set of non-isomorphic ©-repre- 


Received August 26, 1957; in revised form May 21, 1959, 
107 








108 D. G. HIGMAN 


sentation modules N with rational hulls isomorphic to that of M, and such 
that D(N) = D(M). Although simple examples show that there may be 
infinitely many non-isomorphic ©-representation modules with rational hulls 
isomorphic to a given indecomposable right ideal component of A, it seems 
possible that the cardinal r(M) of Sy, which we call the class number of M, 
is finite if the class number 4 of 0 is finite. As was pointed out in (2), Maranda’s 
method for the group case (7) can be extended to prove this if A is separable 
and the rational hull of M is absolutely irreducible. Two members of Sy are 
placed in the same genus if they are isomorphic in the p-adic completion of 
o for all primes p of 0. We denote the number of genera in Sy by g(M), and 
the number of classes in Sy under isomorphism in the p-adic completion of 
0 by rp(M). The final result of this paper is that g(M) < IIr,(M), the product 
extending over the prime divisors of D(M) (\ I(2/®), with the consequence 
that g(M) is finite when 0 has finite residue class rings. 


1. The ideals D(M) and C(M). We need a small amount of homological 
algebra. For the basic notations and definitions of this subject we refer the 
reader to (1 and 5). Throughout this paper, rings will be assumed to have 
identity elements, identity elements of rings and subrings will be assumed to 
coincide, and modules will be right unitary unless otherwise specified. 

Let Q be a K-subalgebra of a K-algebra P, where K is a commutative ring 
with identity element. For a P-module M, we define ideals D(M) = D:p,g)(M) 
and C(M) = Cyp,@(M) by 


D(M) = {x € Kix-Ext'(M, N) = 0 for all P-modules V} 
and 
6 , ~ - - 
C(M) = {x € K\x-Ext'(V, M) = 0 for all P-modules N}, 
where Ext = Ext:p,g) is the relative functor introduced by Hochschild (5). 
According to (5), M is (P, Q)-projective (injective) if and only if Ext'(M, N) 
= 0 (Ext'(V, M) = 0) for all P-modules N, hence 
LemMa 1. D(M) = K, (C(M) = K) @f and only if M is, (P, Q)-projective 
(injective). 
The result we use for computing D(M) in the applications to orders is the 


following. 


LEMMA 2. An element x © K belongs to D(M) if and only if there exists a 
P-homomorphism B : M — M® oP such that Br = x-Iy, wherer : M@®eP > M 
is the natural homomorphism, and I y 1s the identity map of M. 


Before proving this we recall that Ext'(M, NV) can be computed as the first 
cohomology group of the K-complex Homp(X, NV) where X is the left P- 
complex determined by the standard (P, Q)-projective resolution of M. This 
resolution is the (P, Q)-exact sequence 














ORDERS OVER DEDEKIND DOMAINS 109 


4K, @oP™M @oP—-0 
obtained by composing the natural (P, Q)-exact sequences 
0>-K,4M @eP>M--0,0—>K:K; @gP»Ki-0.... 


In particular, x: = r2m2, sO xi71 = 0, and hence 7 is a l-cocycle for 
Hom p(X, Kj). 

We shall now prove the following lemma, and then derive Lemma 2 as a 
Corollary. 


LEMMA 3. 


D(M) 


{x € K|x-Ext'(M, K,) = 0} 
= {x € K\x-7, is a coboundary}. 


Proof. For a > 0, Ka ®q@P is (P, Q)-projective, hence there corresponds 
to each g © Homp(K, @ gP, K;) an element g’ € Homp(Kz @ oP, Ki @ eP) 
such that g’r; = g. If g’r; = 0, then Img’ C Kerr; = Imxe. Hence for a 
l-cocycle f of Homp(X, V), N a P-module, we have g’f = 0. Therefore mapping 
g onto g’f defines a map usa : Homp(Ka @ eP, K1) ~ Homp(Ka @Q P, N). 
These maps are readily seen to define a K-map Hom p(X, K,) ~ Homp(X, N). 
Since 717;' = 0, wya(rf) = f. Since f can be taken as an arbitrary 1-cocycle 
of Homp(X, NV), the lemma follows. 


Proof of Lemma 2. For P-modules A and B, we shall denote by * the natural 
K-isomorphism Homg(A, B) = Homp(A @ gP, B). 
Let 
0>M45M @oP—>K:i—-0 


be a Q-homotopy for the sequence 


0—K,5M @eP>M-0, 


x being the natural homomorphism. 

If now x € D(M), there exists by Lemma 3 an element g © Homp(M, K;) 
such that x-rt = xog*. Then, since «* is the identity map of M @ oP, and 
(gn)* = g*n, 


0 = [x-r — xog*]n = xrn — xog*n = xolxx* — g*n] xol(x-x) — (gn)]*. 


It follows that 8 = x-x — gy is an element of Homp(M, M ®@P). Further, 
Br = [(x-xr) — gnr] = x-Ty. 

On the other hand, suppose such a 6 exists. Let g = [x-« — Bla 
Hom 9(M, K;). Since [x-x — B]r = 0, gn = x-« — B. Further, x08* xor8 0. 
Hence 


xog*n = xo(gn)* = xolx-«* os p*]| = (X-T)n, 


so that xog* = x-7r and x € D(M) by Lemma 3. 








110 D. G. HIGMAN 


The result corresponding to Lemma 2 for C(M) is 


LEMMA 2’. An element x © K belongs to C(M) if and only if there exists a 
P-homomorphism y © Homp(Homg(P, M), M) such that ty = x-Iy, where 
&: M— Homa(P, M) is the natural homomorphism. 


This can be proved in the same way as Lemma 2 by first proving the result 
corresponding to Lemma 3, using the standard (P, Q)-injective resolution of 
M in place of the projective one. In the following when we state a property of 
C(M) we shall omit the proof if it is similar to a proof of a corresponding 
property of D(M). 

To tie in the present work with (2) we will need the following remarks. Let 
R = P @x P’, and let S be the natural image in R of Q @x P’, where the ’ 
denotes reciprocal ring. For an R-module W, Hochschild (5) defines H'(P, Q;W) 
to be Extig.s)(P, W), P being considered naturally as an R-module. If 
K = Q, H'(P,Q; W) = H'(P, W), the right-hand group being taken in the 
sense of cohomology of K-algebras (1). If M and N are P-modules, Hom, (M, NV) 
is given the structure of an R-module, and it is proved that there exists a 
natural isomorphism 


H'(P, Q; Hom,x(M, N)) = Ext':p, 9) (M, N) 


which is readily seen to be a K-isomorphism (5). 

We define D(P, Q) = {x € K\|x-H'(P, Q; W) = 0 for all R-modules W}. In 
other words, D(P, Q) = D.e,s)(P). Asa consequence of the above isomorphism 
we have 


Lemma 4. D(P,Q) C D(M)\ C(M). for any P-module M. 


We remark finally that it is natural to define 
D‘(M) = {x € K\|x-Ext'(M, N) = 0 for all P-modules N}, 


and to define C‘(M) and D‘(P, Q) similarly (¢ = 1, 2,...,). Then the reduction 
theorem (5) gives D'(M) C D?(M) C..., and similar inclusions for the 
others. Moreover, Lemmas 3 and 4 hold for arbitrary D‘, not just for D = D'. 
The applications in this paper are restricted to the case i = 1. 

Results equivalent to Lemmas 1-4 were established in (3), but in a form 
not so convenient for our present purposes as the above. 


2. Orders and principal modules. In this section, 0 will denote an 
integral domain, and © will denote an o0-order, that is, an 0-algebra which is 
regular as an 0-module. Here an o0-module is called regular if it is finitely 
generated and torsion free. It will be convenient to refer to an o0-regular 
-module as an -representation module. We shall in particular determine 
the ©-representation modules M such that D(M) # 0 or C(M) # 0, where 


D(M) = D,~ 9) A) and C(M)=C 


(c (©, 0) 


as defined in § 1. 





or 








ORDERS OVER DEDEKIND DOMAINS 111 


First we introduce some notations useful here and in the later sections. If 
L is an integral domain containing 0 as subdomain, we shall refer to the 
L-order 

DO, = D ® a 
0 
as the L-hull of D. The L-hull of an D-module M is defined to be the ©,- 
module 


M Bs O,=M @,-- 


If k is the quotient field of 0, k-hulls are referred to as rational hulls. Two 
©-modules M and N will be called rationally equivalent if their rational hulls 
are isomorphic. We shall also say that M is rationally equivalent to an A-module 
V, A = Oy, if the rational hull of M is isomorphic to V. 

The L-hull M, of an ©-representation module M is an ©,-representation 
module, and, moreover, the natural homomorphisms M — M, and 


M e,° — ([M @, Ole = M,@.01 
are O-monomorphisms. Hence the following lemma is an almost obvious con- 
sequence of Lemma 2. 


Lemna 5. If L is a ring of quotients of 0 and M is an ©-representation module, 
then 


Dio,. p42) = L-Dis 46M. 
Similarly 

* M,) = L-C,. M). 

C(O, p)"» ©’, 0)‘ f) 


An ©-representation module M will be called a principal O-module if it is 
rationally equivalent to a direct sum of right ideal components of the rational 
hull A of ©. On the other hand, M will be called coprincipal if it is rationally 
equivalent to a direct sum of A-module components of the A-module 


Hom, (A, R). 


THEOREM 1. An -representation module M is principal (coprincipal) if and 
only if D(M) # 0 (C(M) # 0). 


Proof. According to Lemma 5, D(M) # 0 is equivalent to Di, »)(M,) = R, 
which in turn is equivalent by Lemma 1 to the (A, k)-projectiveness of M,. 
But (A, k)-projectiveness coincides with A-projectiveness since & is a field, 
and it is well known that the A-projective modules are isomorphic with 
direct sums of right ideal components of A. 


CoroLLarY 1. D(M)# 0 (C(M) # 0) for every ©-representation module M 
if and only if the rational hull A of © is semi-simple. 











112 D. G. HIGMAN 


Proof. Every A-representation module is the rational hull of some ©-repre- 
sentation module. Hence by Theorem 1, D(.M) # 0 for every D-representation 
module M if and only if every A-representation module is isomorphic with a 
direct sum of right ideal components of A. The latter condition is equivalent 
to the semi-simplicity of A. 

The ideal 7() defined in (2) coincides with the ideal D(D, 0) as we see 
from the last part of § 1. The following theorem was proved more directly 


in (2). 


THEOREM 2. A necessary and sufficient condition for D to have separable 
rational hull is that I(2) be non-zero. 


Proof. By a theorem of Hochschild, 4 = ©, is separable if and only if 
H'(A, W) = O for all A @, A’-modules W. But H'(A, W) = H'(A,k; W), 
so A is separable if and only if 1 € D(A, k). Using Lemma 2 we readily obtain 
that D(A,k) = k-D(D,0) = k-I(D). Hence 1 © D(A,k) if and only if 
I(D) # 0. 

According to Lemma 4, 7(2) GC D(M) for every D-module M. Hence it is 
a consequence of Theorem 2 that for separable A, (\ D(M) # 0, where the 
intersection extends over all D-modules M (0 regular or not). The result of 
(4) implies the existence of non-separable but semi-simple A such that for 
every A-representation module V, (\ D(M) #0, where the intersection 
extends over all ©-representation modules rationally equivalent to V. In 
fact, the Theorem of (4) implies the existence of such A for which every 
A-representation module has finite class number. It is of interest that (\ D(.M) 
may be zero when the intersection extends over the O-representation modules 
M rationally equivalent to a given right ideal component of A. For example, 
if © is taken to be the Z-order of all matrices 


C 


with x, y, and z in the ring Z of rational integers, the O-representation module 
M, corresponding to the matrix representation mapping X onto 


() 


for fixed rational integer m has D(M,) = nZ, as is readily seen using Lemma 2. 
Hence (\ D(M,) = 0. Further, every M, (m = 1,2,...,) is rationally equiva- 
lent to the same indecomposable right ideal component of the rational hull A 
of ©. Of course A is not semi-simple. 

The following additional remarks may be in order here. We noted at the 
end of § 1 that the ideal J(D)) = D(, 0) is merely the first member of an 
ascending chain of ideals of 0: 7(D) = 7'(D) C (HD) C..., (HD) = 
D"(D, 0). If 0 satisfies the ascending chain condition, there is a first m such 
that I"(D) = J"*'(D) = ..., and it may be of interest to ask what is the 














ORDERS OVER DEDEKIND DOMAINS 113 


significance of this m for separable A. It follows from a result of (3) that 
n = 1 for a group ring ©. Similar remarks apply to D(M) and C(M). It may 
also be of interest to look in the set 7 of O-representation modules rationally 
equivalent to a given right ideal component of A for those such that D(M) 
is maximal for M € T. When can we find D(M) = 0, that is, M (2, 0)-pro- 
jective? In this regard see Theorem 11 following. 


3. The local case. In this and the next two sections we assume that 
0 is a local domain, that is, that 0 is a principal ideal domain in which the 
non-units constitute the unique prime ideal p = ro. As always, D denotes 
an 0-order. 

For an integer s > 0, 0 will denote 0/z*-0, and © will denote the 
o-algebra O/2*-D. It is the main purpose of this and the next section to 
study relations between the representation theory of D, OD", and the rational 
hull of © The results are largely generalizations of results obtained by 
Maranda (6; 7) for the group case, and extensions of some results of Reiner 
(8) also arise. 

The o-module M/r*- M will be denoted by M“, and can be considered as 
an o-module. If M is an O-module, so is M“™, and M“ may be considered 
as an ©‘-module. For f € Homt (M, NV), f™ will denote! the natural image 
in Homt(M™, VN); if f is an O-homomorphism, so is f‘. We shall say 
that O-modules M and N are isomorphic modulo p* if M“™ and N“ are iso- 
morphic as O-modules or, what is the same thing, as D‘"-modules. 


~ 


— 


As in § 1, we may use the standard (DQ, 0)-projective resolution of an 
©-module M to compute Ext'(M, V). Taking into account the natural D- 
isomorphisms 


[M @ 9D)” = M” @pD = M™ @ywd™, 
we see that if 
%,K,@pD~4 M @yD 0 


is the standard (©, 0)-projective resolution of M, then 


(s) (s) 


A, [K: @p HD)” ~> [M @o 0) 0 


may be identified with the standard (©, 0)-projective resolution of M 
considered as an ©-module, or with the standard (0, o*)-projective 
resolution of M“” considered as an )‘-module. We shall denote by X“” the 
left O-complex determined by this resolution, and by X the left OD-complex 
determined by the standard (©, 0)-projective resolution of M. Then 


‘A dagger on the homomorphism (Homf) indicates that the homomorphism is taken with 
respect to the domain 0 








114 D. G. HIGMAN 


Ext’ (6 »)oe™, wn) 


is the 1-dimensional cohomology group of Homf (X“, VN“), and 


Ext’ ~ (s) 
(D*”, 0 


is the 1-dimensional cohomology group of 


cy) (M™, Ny“) 


Hom. (X™, Nn”), 


which is merely Homt (X“, V“) considered as an o“-complex.? Thus 
J i! 


is simply 


Ext'(s 5), nv“) 


considered as an o“-module. It follows that 


io (M) = (D (M“) +p”) /p, 


Dip » (D, 0) 


as 
® 


fact that is also readily seen from Lemma 2. 
We now prove the following generalization of Maranda’s Theorem 2 of (6). 


THEOREM 3. Let M and N be ©-representation modules, and assume that 


nr’ - Ext’ _ (M, N) =0. 


(2, 0 
Then M and N are isomorphic if and only if they are isomorphic modulo »**"'. 
Proof. An D-isomorphism M+ = N+» is induced by an 0-isomorphism 


8:M = N, for M and N have free o-module bases since 0 is a principal 
ideal domain. Then for u € M, w € OD, B(uw) — B(u)w © w*t!-N. Let 


aa Hom (M, N) = Hom. (M ®, 2), NV) 
be the natural isomorphism, then this identity means that x98* =0 (mod (2**')). 
Hence there exists 


f € Hom (M@® 2O,N) 
0 0 


such that 
s+1 , 
x.6* = @ “J. 
Since x, and §* are O-homomorphisms, so is f. And 


s+1 


0 = xixo8* = er” xi, 





*A double dagger on the homomorphism (Hom) indicates that the homomorphism is taken 
with respect to the order ©. 


ORDERS OVER DEDEKIND DOMAINS 115 


so xif = 0 and f is a 1-cocycle. The assumption of the theorem therefore 
implies that r*f is a coboundary, and hence that there exists g © Homt (M, NV) 
such that r*-f = xog*. Let a= 8 —m-g, then xoa* = xo8* — rxg* = 
a*+!.f — x(x*f) = 0, which implies that a is an element of Homt (M, N). Since 8 
induces an isomorphism M+ = N+», det 8¢ p. But det a = det 8 (mod p), 
so deta ¢ p. Hence det a is a unit in 0 anda: M = N. 

Since the converse is immediate, the proof of Theorem 3 is complete. 


We now define the depth (codepth) of an D-module M to be s if D(M) = p' 
(C(U) = p*) and © if D(M) =0 (C(M) = 0). Thus by Theorem 1, the 
principal O-modules are the -representation modules of finite depth (co- 
depth). 

An immediate consequence of this definition of depth and Theorem 3 is 


COROLLARY 1. A principal (coprincipal) D-module of depth s (codepth s) is 
isomorphic with an D-representation module N if and only if M and N are 
isomorphic modulo p**'. 


Simple examples of the sort given at the end of § 2 show that the number 
of non-isomorphic ©-representation modules rationally equivalent to a given 
,-representation module V may be infinite, even if V is an indecomposable 
right ideal component of ©, and o has finite residue class rings. But we do 
have 


CorOLuary 2. Jf o'*+” is finite, the number of non-isomorphic D-representa- 
tion modules of depth (codepth) s and given rank is finite. 


Proof. Corollary 1 implies that the isomorphism class of M is determined 
by the isomorphism class of M“*”, But O°+” and M+” have only finitely 
many elements as finitely generated modules over the finite ring o*". 

If the rational hull of © is separable, J(]) = D(O, 0) is non-zero by 
Theorem 2, hence J(D) = p‘. We call ¢ the depth of ©. By Lemma 4, 
I(2) © D(M) for every O-module M, hence 0 < depth M <¢# for every 
©-module M. Hence in this case Corollary 2 implies 


CorROLLARY 3. If © has separable rational hull and 0 has finite residue class 
rings, then the number of non-isomorphic D-representation modules of given rank 
is finite. 


We now show that depth is preserved under the transition to the complete 
case. Let kx denote the completion of the quotient field & of 0 with respect 
to the valuation determined by p. Let 0% be the valuation ring of ks, and let 


x = 70% be the valuation ideal. We denote by Ox and Ms the 0«- hulls of 
© and M respectively. 


THEOREM 4. For an ©-representation module M, 


(M) =D (Mx) () 0« 


D's, 0) (Dx, O«) 


and 








116 D. G. HIGMAN 


(M) = Cs (Ms«) () 0x. 


*, Ox) 


C’o, 0) 


Proof. We use Lemma 2, according to which x € 0 belongs to D(M) if and 
only if there exists an ©,-homomorphism 


8:M,—([(M @ Ole 


such that 


(i) Bry = x-I, where 
m:(M® Oh—- M 
0 
is induced by the natural homomorphism 
7:M eo — M, 


and J is the identity map of M,, and 


(ii) B(M) C M @pD, where M and M @p©® are identified with their 
natural images in M, and [M @p ©); respectively. 

Since 0 is a principal ideal domain, there exist free 0-module bases u, . . . , Um 
and v;,...,0, of M and M @pO respectively. If we write B(u,;) = > 
then for given x € 0, (i) may be expressed as a system of linear equations 
in the unknowns a;,;, with coefficients in 0. The condition that x be in D(M¢) 
means that the system has a solution in 0%. But a solution exists in 0« if and 
only if one exists in 0, that is, if and only if x € D(M), proving Theorem 4. 

According to the definition of depth given above, Theorem 4 can be re- 
stated as 


CorROLLARY 1. The depth of an ©-representation module M is equal to the 
depth of its o*-hull M*. 


An important consequence for our purposes is the following extension of 
Corollary 1 to Theorem 1 of Maranda’s paper (7). 


COROLLARY 2. A principal (coprincipal) O-module M is isomorphic with an 
D-representation module N if and only if Ms and Nx are isomorphic. 


Proof. Suppose M has depth s, then D( Mx) = x*’0« by Corollary 1. Now 
oft) = 94+), and, as o+)-algebras, OCtY = O,¢+, Further, as O(+”- 
modules, M@+? = M,“+. The result now follows by Corollary 1 to Theorem 
3. 

We remark that a similar application of Lemma 2 proves that J7(D) = 
I(Ds*) (\ 0, and hence that the depth of © is equal to that of Ds. 


4. The complete case. We retain the notation of § 3, and assume in 
addition that k is complete, that is, that k = kx. 


h 





ORDERS OVER DEDEKIND DOMAINS 117 


We need the following remark, which depends only on the fact that 0 is 
a principal ideal domain. 


LEMMA 6. Given ©-representation modules M and N, and s > 0, every ©- 
homomorphism 


[Me Oo)” 3—N™ 
0 


is induced by an D-homomorphism M @p D — N. 


« 


Proof. Denote by * the natural isomorphism 


Hom (U, V) =~ Hom~(U ® 2, V) 
0 L 0 


for any two ©-modules U and V. If 
g € Hom. ((M @ OD)’, N”), 
g ol @ 9) 


then g = h* for some h € Homt (M™, N™). Since 0 is a principal ideal 
domain, there exists f © Homt (M, NV) inducing h. But then 


f* € Hom. (M @,o: N) 
induces h* = g. Here we have used the natural identification of 
M’’® D with [M@ O]™. 
0 0 


A homological extension of a result of Reiner (8) is the following. 
THEOREM 5. If M and N are ©-representation modules, then 
w*-Ext! (M+), NC+D) = 0 
implies r*-Ext'(M, N) = 0. 


Proof. Suppose that 2*-Ext'( Mt", N@+)) = 0, and let X be the left 
©-complex determined by the standard (©, 0)-projective resolution of M. If 
f:K, @p9 DN is a 1-cocycle for Homf(X, NV), then f+” is a 1-cocycle 
for Hom{(X “+, N+). (The notations X and X“'+” refer to the left O- 
complexes obtained from the standard (©, 0)-projective resolutions of M and 
M+» respectively, cf. the third paragraph of § 3.) Hence x*-f“*+” is a co- 
boundary, which means, using Lemma 6, that there exists an D-homomorphism 
go: M @p9 DN such that 


a*-f = xogo (mod (r**')), 
that is, there exists an 0-homomorphism f/f; : K; ®p O — N such that 
aw*-f = Xogo + with. fy. 
Since f and xogo are O-homomorphisms, so is f;. Since f is a cocycle, 


mtl.yi fy = wr -xif — xixogo = 0, 








118 D. G. HIGMAN 


so f; is a cocycle. Hence, repetition of the above produces an D-homomorphism 
g:1:M @p90D—N and an o-homomorphism f,:K; @pO—-WN such that 
m*-f, = xogi + 2°*!-fe. Then 


af = xo(go + Ww £1) 4- wit? f, 


and again we see that fs is a cocycle. Continuing in this way we obtain ©- 
homomorphisms 


such that 
m*-f = xo(go + w-g1 +... + 2+ g,) (mod p****!). 


Since k is complete, we may define g = go + w-gi1 +... + w'-g, +..., and 
conclude that r*-f = xog is a coboundary. Hence x*- Ext'(M, V) = 0, proving 
Theorem 5. 

COROLLARY 1. An -representation module M has depth (codepth) s if and 
only if M“*” has depth (codepth) s (as an D-module). 

Proof. By Theorem 5, if M“*"” has depth s, M has depth < s. But it follows 
at once from Lemma 2 and the existence of the natural isomorphism 


[Me O° =M°@ © 


that the depth of M is < the depth of M for any t. 
Since by Lemma 1 the ©-modules of depth O (codepth O) are precisely 
the (D, 0)-projective (injective) ones, the case s = 0 of Corollary 1 gives 


COROLLARY 2. An ©-representation module M is (©, 0)-projective (injective) 
if and only if M/x-M is (©, 0)-projective (injective). 


This is a slight improvement of a result of Reiner (8) since the ‘‘only if” 
part does not require special hypotheses. Note the (©, 0)-projectiveness and 
(O/2-D, 0/x-0)-projectiveness coincide for an O/x-D-module. 

We observe, without including the details, that essentially the same argu- 
ment used to prove Theorem 5 and Corollary 1 proves 


> 


THEOREM 6. © has depth t if and only if D(O"*", 0) = p'(D"'+” being 
considered as an 0-algebra). 


The case ¢ = O, combined with Hochschild’s characterization of separable 
algebras gives the 

COROLLARY: © has depth O if and only if D/r-D is a separable 0/x- 0-algebra, 

For application in § 5 we need 

THEOREM 7 (Brauer). Jf H is an (©, 0)-projective (injective) D-representa- 


tion module, and if U is an 2-module direct summand of H/x-H, then there 
exists an D-module direct summand M of H such that M/x-M = U. 


L- 


ORDERS OVER DEDEKIND DOMAINS 119 


We will derive this theorem here as a corollary to an extension of a variant 
of Maranda’s Theorem 3 of (6). 

If M is a primitive 0-submodule of an ©-representation module H (that 
is, an 0-module direct summand), we may identify M“ = M/xr-M with the 
o-submodule (M + x‘-H)/x*-H of H™, and then we may identify [H/M) 
with H’/M®. Then M is an ©-submodule of H™ if and only if MD C 
M + x°-H, and every 0-primitive D-submodule of H™ is obtained in this way 
from a primitive o-submodule of H. 


THEOREM 8. Let M be a primitive 0-submodule of an D-representation module 
H, such that MD C M + x**!-H and M+” has depth s as an D-module. 
Then there exists an )-submodule M* of H of depth s, and primitive as an 


o-submodule, such that M*¥@+ = M+, 

Proof. We construct a sequence Mo, M,,..., / M,,..., of primitive o-sub- 
modules of H such that M,O C M, + r%-H, so = 5s +1, Seas s; + 1, and 
Migi?t? = M,“*”. The existence of an 0-primitive O-submodule M* of H 
with M*¢+) = M+» then follows by the completeness of k. Since M+ 
has depth s, Corollary 1 to Theorem 5 implies that M* also has depth s. 

The M, are constructed inductively. Let My = M, and assume that M, has 


been constructed. Since M, is primitive, there exists an o-submodule N of 


H such that H = M, @ N. Then 
MO CM, + r%*-H = M,@ r%-N. 


Thus, for u € M, and w € ©, there exist unique elements o(u, w) VW, and 
t(u,w) € N such that 
uw = o(u,w) + w-7r(Uu, w). 


We denote by 7 the D @p ©’-module 
Hom, (M;"*”, (H/M,)"*”), 


and by r* the element of Homt(, 7) such that r*+(w){@} is the residue class 
modulo 2x*+!-[H/M,] of M, + r(u,w) € H/M,, where @ is the residue class 
modulo r‘+!-M, of u € M,. From the associative law u(tn) = (ué)n we get 
the identity 


T(u, En) = rlo(u, &), n) + r(u, E)n, 


which means that r* is a l-cocycle for the complex with homogeneous com- 
ponents C"(D, 0; 7) described in (6, § 3). Since the 1-dimensional cohomology 
group of this complex is Ext'(M,“*”, [H7/M,)“*»), and since M,@*? = MC+Y 
has depth s, it follows that r*-r+ is a coboundary. Hence there exists 


g © Homo(H, N) such that for u € M,, w € ©, 
T(u,w) = g(uw) — g(u)w + rt! -u(u, w), 


with yw(u,w) € H. Let My; = fu + w%-*-g(u)lu € Mj, then My, is a 








120 D. G. HIGMAN 


primitive 0-submodule of H since H = M,,@N. Since 2s;—s>s,4+1 = sia, 
we have for w € © and u € M, that 


[u + 2**-*-g(u)|lw = uw + 2**-*g(u)w 


o(u,w) + w*i-r(u, w) + 2**"{g(uw) — 2*-7(Uu, w) 
+ w*t!-u(u, w) 


Il 


a(u,w) + 2**-*-g(a(u, w))+ 2?***- g(7(u, w)) 
+ w**ly(u,w) © Mes + oA. 


Hence Mii CS Mays + **!-H. 
Now we define an 0-module automorphism ¢ of H by mapping u € M, 
onto u + m**-*g(u), and v € N onto 2, so that in particular ¢(M,) = M 441. 


Sy 


For u € M, and w € ©. 
o(uw) = o(o(u, w) + w*i-7(u, w)) = o(u,w) + 2**-*g(a(u, w)) + r*i-7(u, w) 


= a(u,w) + r***-g(a(u, w)) = o(u)w (mod (x*')). 


We may conclude that M,.,°+” = M,“+», which means that the inductive 
construction of the M, is complete. 

We can deduce Theorem 7 from Theorem 8 and Corollary 2 to Theorem 5 
as follows: Since H is (©, 0)-projective, so is H/x-H by Corollary 2 to Theorem 
5, hence so is the ©-module direct summand U. Theorem 8 (with s = 0) 
therefore implies the existence of an (OD, 0)-projective O-submodule M such 
that M/rM = U. 


Another application of Theorem 8 is the following 


CoroLiary. Let U be an D+-module, that is, an D-module such that 
nt!.U = 0. If U has depth (codepth) s as an D-module, and is finitely generated 
and projective as an 0“+-module, there exists an D-representation module M* 
of depth (codepth) s such that M*°*+” = U. 


Proof. Since U is a projective and finitely generated o“+-module, it is 
an o“+-module direct summand of a finitely generated free o+”-module 
V. Now there exists a regular o-module N such that V“+? = V. Moreover, 
there exists an ©-isomorphism Homf (©, V) = [Hom (9, V)}]@+”, and an 
©-monomorphism U — Hom [, V), namely, the composite U — Homf# (, UV) 
— Hom (©, V). It follows that there exists a primitive o-submodule M 
of H = Hom? (©, V) such that MD C M + x**'H and M“+” = U. The 
existence of M* now follows from Theorem 8. 


5. Projective modules in the complete case. We assume, as in the 
preceding section, that the quotient field k of 0 is complete. Applying Theorems 
3 and 7 we obtain 


THEOREM 9 (Brauer). Up to isomorphism and order of summands, every 
(D, 0)-projective O-representation module has a unique decomposition into a 
direct sum of indecomposable D-modules. If D = > ® Oa, where the Da are 





Ww 


rc 


m 





ORDERS OVER DEDEKIND DOMAINS 121 


indecomposable right ideals, then D/rD = > @ Oa/eOa is a decomposition 
of the 0/r0-algebra 2/xD into indecomposable right ideals, and any indecom- 
posable (D, 0)-projective D-representation module is isomorphic with one of the 


Da. 

Proof. Since ©/rD is a finite dimensional algebra over the field 0/70, 
D/rD = L @ O. with the ©, indecomposable right ideals unique up to 
order and isomorphism. By Theorem 7, there exist right ideals D, of © such 
that OD = 2 @® O. and O./rO. = Oa. Since the O, are indecomposable, 


Theorem 7 implies that the , are too. 

If M is an (9, 0)-projective O-module, so is M/xM by Corollary 2 to 
Theorem 5, and hence M/rM is (O/x, 0/20)-projective. As is well known, 
this implies that M/rM = > @ Mg, where each Mz is isomorphic to some 
©.. Hence by Theorem 7, M = }> @ Mz, where each Mz, is isomorphic to 
some Da. 

Suppose that M = >> @ N,, with each V, indecomposable. Then V,/xN, 
is indecomposable by Theorem 7 and M/xM = >> @ N,/xN,. Hence by the 
Krull-Schmidt theorem, the M, and N,/xN, are equal in number and iso- 
morphic in pairs. Corollary 1 to Theorem 3 (with s = 0) implies therefore 
that the Mg, and 1, are isomorphic in pairs. 

There is a similar result for (©, 0)-injective ©-representation modules, 
which may be proved similarly. 

We now generalize Maranda’s Theorem 4 of (7). It will be convenient 
to begin with two lemmas, the second of which is a special case of our thorem. 


LemMa 7. If © has depth O, and the D-representation module M has irreducible 
rational hull, then M/xM is an irreducible D/xD-module. 


Proof. lf D has depth O, every O-module is (OD, 0)-projective, and by the 
Corollary to Theorem 6, D/7® is a separable 0/o-algebra. Hence M/xM is 
fully reducible, and Theorem 7 implies that it is irreducible. 


LemMMA 8. If © has depth O, two -representation modules M and N with 
irreducible rational hulls are isomorphic if and only if their rational hulls are 
isomor phic. 


Proof. \f M and N have isomorphic rational hulls, we may assume that 
M CN. By Lemma 7, N/rN is irreducible, hence M/rM = M + rN/xN 
= N/xN. Corollary 1 to Theorem 3 implies therefore that M = N. The 
converse is immediate. 

We let R denote the radical of the rational hull A of D. Then 0/® is an 


0-order with rational hull isomorphic to 4/R, where R = OF) R. 


Oo 


THEOREM 10. If D/R has depth O, then two (OD, 0)-projective O-representation 
modules are isomorphic if and only if their rational hulls are isomorphic. 


Proof. Only the “‘if’’ part requires proof, and by Theorem 9 we have only 
to consider indecomposable ©-representation modules. 








122 D. G. HIGMAN 


Let M be an indecomposable (©, 0)-projective O-representation module, 
and let V = M,. Then V = V; @... @® V;, where the V, may be taken to 
be indecomposable right ideal components of A according to Theorem 1. Let 

W.=X.+ D V;, 
ji 
where X, is the unique maximal A-submodule of V;. Then V/W, = V;/Xy» 
the irreducible A-submodule determining the isomorphism class of V;, which 
we shall denote by F;. We show now that F,; = F (i = 1,2,...,4), F= Fi. 

We note first that §; = M/M(C\ W, is an ©-representation module with 
rational hull isomorphic to F;. Since §; can be considered as an D/R-module, 
Lemma 7 implies that §,/r¥; is an irreducible O/rD-module. Theorem 7 
implies that M/xM is an indecomposable D/xO-module, hence, since § ;/7¥ , 
is isomorphic with a quotient module of M/zM, it follows that §,/x¥, is 
uniquely determined by M/xM. That is, §./r¥: = §/eH] (¢ = 1,2,..., 0), 
where § = $1. Hence §; = § by Corollary 1 to Theorem 3, which certainly 
implies F, = F. 

Now let N be a second indecomposable (, 0)-projective O-representation 


module such that V, = V, and let G correspond to N as § does to M. Then 
we must have @, = F, which implies G = § by Lemma 8. Hence @/rG = §/x¥, 
which implies V/rN = M/xM. Hence M = N by Corollary 1 to Theorem 3. 


An immediate consequence of the above proof is the following. 


Coro.iary. If O/R has depth O, then the rational hull of an indecomposable 
-representation module M is isomorphic with a direct sum of isomorphic 
indecomopsable right ideal components of A. 


It must be remarked that the completeness of 0 is inessential for Theorem 
10 and its Corollary, for, by Corollary 1 to Theorem 4, and the remark at 
the end of § 3, the hypotheses survive transition from the local to the complete 
case, and by Corollary 2 to Theorem 4, if the conclusion holds relative to 
the completion of a local domain 0, it holds relative to 0. 


6. The Dedekind case. In this final section we assume that 0 is a Dede- 
kind domain, and denote by 0, the ring of quotients of 0 with respect to the 
complement of the prime ideal p of 0, that is, the ring of p-integers in the 
quotient field k of 0. By DO, we denote the o,-hull of the o-order ©, and by 
M, the 0,-hull of an o-module M. A subscript « refers to the p-adic completion 
as at the end of § 3. 

According to Theorems 1 and 4, we have for any ©-representation module 


M that 


0» Digs g(M) = Dig 9, (Me) = Dig, 5,4) Mow) 1 O». 


Therefore, if we define d,(M) to be depth M, = depth Mys, we have 
D(M) _ I] ptr 
D 


n 
n 


ORDERS OVER DEDEKIND DOMAINS 123 
Similarly we have 
C(M) = [] p®%” 
te) 


where cy(M) is defined to be codepth M, = codepth Mys. Some consequences 
of these formulas are summarized in 


THEOREM 11. An ©-representation module is principal (coprincipal) if and 
only if its Oy-hull My, is principal (coprincipal) for every prime » of 0. If M is 
principal (coprincipal), then My, is (Dp, 0»)-projective (injective) for all but the 


finitely many primes » dividing D(M) (C(M)), and M is (©, 0)-projective 


(injective) if and only if My is (Dy, 0p)-projective (injective) for all ». 


These results, together with Corollary 2 of Theorem 5 contain results of 


Reiner (8). 


(D) — ; 

, where d,() is defined 
to be depth OD, = depth Oye. From this, Theorem 2, and Corollary 2 to 
Theorem 2 we conclude that 


We obtain in the same way that 7(D) = II,p® 


THEOREM 12. If © has separable rational hull, D/pD is a separable o/p- 
algebra for all but the finitely many primes » dividing I(D). 


Finally we apply our results to the question of class numbers. Let S be a 
complete set of non-isomorphic, rationally equivalent ©-representation 


modules. Then there exists an A-representation module U, A = ,, such 
that M, = U for all M in S (and we may in fact assume that M is an ©- 


submodule of // for every M € S). The cardinal r = r(U) of the set S is 
called the class number of U (with respect to 2). 

As was pointed out in (2), Maranda’s method (7) can be extended to 
prove that if A is separable, every absolutely irreducible A-representation 
module has finite class number, the ideal J(D) # 0 playing the role played 
by the group order in his arguments. On the other hand, the result of (4) 
shows the existence of non-separable semi-simple A for which every A-repre- 
sentation module has finite class number. 

In the example of § 2, 0 = Z, the ring of rational integers, and the M, 
2 i See ) are readily seen to constitute a complete set of non-isomorphic 
©-representation modules rationally equivalent to a fixed indecomposable 
right ideal component of A. Hence U has infinite class number. But the fact 
that D(M,) = nZ (nm = 1,2,...,) suggests the following definition for the 
general case: The class number r(M) of a principal O-module M is defined 
to be the cardinal of the set Sy, where Sy is a complete set of non-isomorphic 
©-representation modules N rationally equivalent to M and such that 
D(N) = D(M). We cannot prove here that r(M) is finite when o has finite 
ideal class number, but we reduce the problem somewhat, along the lines 
of the first main result of (7). 











124 D. G. HIGMAN 


Note that for an A-representation module U, 
r(U) = >> r(M) 


where the sum is over a set of M € S such that D(M) takes on exactly once 
each possible value. In the case of separable A, [(O) # O, and hence, since 
D(M) divides I(D) for every M, the number of summands is finite. That the 
sum need not be finite in the general case is shown by the example mentioned 
in the preceding paragraph. 

We shall say that two ©-representation modules M and N are equivalent 
at a prime » of o if their 0)-hulls are isomorphic, or what is the same thing 
by Corollary 2 to Theorem 4, if their 0)*-hulls are isomorphic. We shall say 
that M and N belong to the same genus if they are equivalent at all primes 
p of o. If M is a principal O-module, we let 


ry(M) = the number of classesin Sy, under equivalence at p, 
and 


g(M) = the number of genera in Sy. 


From the definitions, we have g(m) < II,r,(.Z). By Theorems 10 and 11 we 
have that r,(M) = 1 when p X D(M) (\1(D/R), where R= OOR, R 


being the radical of A. Corollary 2 to Theorem 3 implies that rp)(\/) is finite 
when 0 has finite residue class rings. Hence 


THEOREM 13. Assume that A/R is separable. Then for a principal D-module 
M, g(M) < IIr,(M), the product extending over the primes » dividing 
D(M) (\ I(2/R), and g(M) is finite if 0 has finite residue class rings. 


For an A-module U, let g(U) denote the number of genera of O-representa- 
tion modules with rational hull isomorphic to U, and for a prime ideal p of 
0, let r,(U) denote the number of classes under equivalence at p of such 
-representation modules. As was pointed out in (2), if A is separable and 
U is absolutely irreducible, Maranda’s method's (7) can be extended to 
preve that 


(1) g(U) = I] ry(U) 

p 
where the product extends over all prime ideals p of 0, and 
(2) r(U) = h-g(U) 


where hi is the ideal class number of 0. We leave open the general questions 
of when equality holds in the formula of Theorem 13, and when r(M) = h-g(M) 
for an D-representation module M. (See also (9) in this regard.) 

Taking M to be a coprincipal D-module instead of principal, and replacing 
D(M) by C(M) in the above definitions, we obtain numbers s(M), s)(M), 
and h(M), between which relations hold analogous to those for r(M), rp(M), 
and g(M). 


ORDERS OVER DEDEKIND DOMAINS 125 


There are also some relations between left and right which should be 
mentioned. If M is an ©-representation module, M+ = Homf (M, 0) is a 
left O-representation module, and since 0 is a Dedekind domain, Homt (M*,o) 
= M and Homf# (M, NV) = Homf (N+, M+) for -representation modules 
M and N. Now it can be verified that Ext'(M, VN) = Ext'(N+, M+), and hence 
that D(M) = C(M*) and C(M) = D(M*). Moreover, relations such as 
r(M) = s(M*) and g(M) = h(M*) hold. 


REFERENCES 


. H. Cartan and S. Eilenberg, Homological algebra (Princeton, 1956) 
D. G. Higman, On orders in separable algebras, Can. J. Math., 7 (1955), 509-515. 
Relative cohomology, Can. J. Math., 9 (1957), 19-34. 


rr PS 


D. G. Higman and J. E. MacLaughlin, Finiteness of class numbers of representations of 
algebras over function fields, to appear in the Michigan Journal of Mathematics 

5. G. Hochschild, Relative homological algebra, Trans. Amer. Math. Soc., 82 (1956), 246-269 

6. J-M. Maranda, On p-adic integral representations of finite groups, Can. J. Math., 5 (1953), 
344-355. 

7. - On the equivalence of representations of finite groups by groups of automorphisms of 
modules over Dedekind rings, Can. J. Math., 7 (1955), 516-526. 

8. I. Reiner, Maschke modules over Dekedind rings, Can. J. Math., 8 (1956), 329-334 

9. On class numbers of representations of an order, A. M. S. Notices, 5 (1958), Abstract 

no. 548-142, 584. 


~ 


‘niversity of Michigan 








INTEGRAL p-adic NORMAL MATRICES SATISFYING 
THE INCIDENCE EQUATION 


J. K. GOLDHABER 


1. Introduction. The problem of arranging v elements into v sets in such 
a way that every set contains exactly k distinct elements and that every pair 
of sets has exactly \ = k(k — 1)/(v — 1) elements in common, where 0 < A < 
k < v, is equivalent to finding a normal integral v by v matrix A such that 
ATA = B, where B is the v by v matrix having k& in every position on the 
main diagonal and \ in all other positions (10). Utilizing the fact that for 
the existence of a X, k, v design it is necessary that J (the v by v identity matrix) 
represent B rationally, (2) and (3) have proved the non-existence of certain 
\, &, v designs. Neither of the proofs utilize the fact that it is necessary that A 
be normal. However, Albert (1) for the projective plane case and Hall and 
Ryser (5) for the general design proved that if there exists a rational A such 
that A7A = B then there exists a normal rational matrix satisfying the same 
equation. Thus the requirement of normality does not exclude any \X, &, v 
which were not previously excluded. 

It is evident that for the existence of a X\, &, v design it is necessary that for 
every prime p there exist an integral p-adic normal matrix A satisfying 47A = 
B. Assuming that (k, k — A) = 1, we prove in § 2 that if J represents B 
rationally then this necessary condition is satisfied. Thus, once again, no 
additional designs are excluded. It does follow, however, that if J represents 
B rationally then J represents B without essential denominator and, further- 
more, that there is a form in the genus of J which represents B integrally. 

In § 3 we consider a modified incidence equation which is satisfied by every 
incidence matrix and which, if J represents B rationally, has integral solutions. 
Sufficient conditions for the existence of a A, k v design in terms of these 
integral solutions are given. 


2. The incidence equation examined locally. We assume throughout 
this paper that (k, k — A) = 1. Thus, since Av = k? — (k — A) we have 
(A, k) = (A, k — A) = (uv, k) = (v, R — A) = 1. The matrices J and B are as 
above. We prove 


THEOREM 1. Jf I represents B rationally then for every prime p there exists a 
matrix A with elements in the ring R(p) of p-adic integers such that ATA = AAT 
= B. 


Received November 3, 1958. Research supported in part by the Office of Ordinance Research 
under Contract DA-23-072-ORD-1051. 


126 


q 
i 


el 


INTEGRAL P-ADIC NORMAL MATRICES 127 


We show first that there exists a matrix C (not necessarily normal) with 
elements in R(p) such that C’C = B. It follows from well-known theorems on 
quadratic forms (7) and the fact that J and B are both positive definite that 
it is sufficient to show this for all primes » € P, where P is the set of all 
prime divisors of 2-det B = 2k?(k — d)*'. Let T be the v by v matrix 








1 —!1 —-l —] —] 
l l 0 0 0 
] 0 l 0 0 
l 0 0 ] 0 
l 0 wv) 0 ] 
Then 
— v 0 
TTT = 
. lo h - 
where J, is the (v — 1) by (v — 1) identity matrix and S, is the (v — 1) by 
(v — 1) matrix each of whose entries is 1. Also, 


ky 0 
ne 
sie lo em | 


Since (k, k — \) = 1, v is a p-adic unit for all odd p € P. v is also a 2-adic 
unit in the case that v is odd. Hence, for odd p, X7X = B is solvable in 
R(p), p © P, if and only if X7(77T)X = T’BT is solvable in R(p); and for 
odd v, X7X = B is solvable in R(2) if and only if X7(777)X = T7BT is 
solvable in R(2). 

We first dispose of the case when v is even. Since J represents B rationally, 
(k — i) is a square (3); whence, obviously 777 represents 77BT in R(p) for 
all odd p € P. Furthermore, since v is even and (k, k — A) = 1 it follows that 
k and k — ) are both odd. Thus J and B are properly primitive forms (that 
is, each has a 2-adic unit element on its main diagonal) with unit 2-adic 
determinants which, since they are rationally congruent, are congruent over 
the 2-adic field. Hence (7, Theorem 36) they are equivalent in R(2). Thus, 
if v is even J represents B in R(p) all p © P. 

Suppose now that v is odd. It is clearly sufficient to show that J, + S, 
represents (k — A)(J; + Si) in R(p) for all p € P. 

(i) J, + S; represents (k — A)(J, + S;) in R(2). 

(a) Suppose (k — A) = 2? m where b > O and m is odd. We make use here, 
and below, of the following known theorem (6): 

Two improperly primitive forms (that is, each form has some 2-adic unit 
element but no 2-adic unit element on its main diagonal) in the same number 











128 J. K. GOLDHABER 


of variables and of odd determinants are equivalent in R(2) if and only if 
their determinants are congruent mod 8. 

From this it follows that J, + S; and m(J, + S,) are equivalent in R(2). 
But then, obviously, 7, + S; represents 2?” m(J, + S,) in R(2). 

(b) Suppose k — \ = 2?°+'m where m is odd. We shall show below that in 
this case the assumption that J represents B rationally implies that v = + 1 
mod 8. 

If v= 1 mod 8 then J, + S; and m(J, + S;) are both equivalent to the 
4(v — 1) fold direct sum of the matrix 


0 ] 

1 oO] 
Call the direct sum matrix K. It is thus sufficient to show that K represents 
27+ K. Since v = 1 mod 8, 4|v — 1. Let LZ be the }(v — 1) fold direct sum of 


0 l 0 -!1 
] 0 —!] 0 


Then (2°L)7 K(2°L) = 2°*' K as desired. 
If v= — 1 mod 8 then J, + S; and m (J, + S;) are both equivalent in 
R(2) to Ki @ Ke. Here @ denotes direct sum, K, is the 34(v — 7) fold direct 


sum of 
0 l 
l 0)}° 


and K; is the 6 by 6 matrix having each entry on its main diagonal equal to 2 
and all other entries equal to 1. Note that 4/v — 7. Let LZ, be the }(v — 7) 
fold direct sum of the 4 by 4 matrix given in the preceding paragraph, and 
let L. be the matrix 


—|} 0-1 -l - 
1 0 0 0 
-1 -l]1 -!] 0 

0 0 ] 1 

] | 0 

] 0 l 


oroor = 


1 








—mmw mw © 
~ os | 


Then [2°(L; @ L.)|*? (Ki @ Kz)[2°(Li @ Le)| = 2?°*'(K, @ Kz); whence, J; + S; 
represents (k — A)(J; + S;) in R(2) as desired. 

It remains to show that if J represents B rationally and if Rk — \ = 2?°+'m, 
m odd, then v = + 1 mod 8. Since \v = k? — (k — X) we have 


(29). (2) 1 





— NO 


Si 


INTEGRAL P-ADIC NORMAL MATRICES 129 


where (a/b) is the Jacobi symbol. (Since (k,  — A) = 1, \isodd.) We thus have 


(") 0 (3) (-1F 


We consider the cases b > 1, b = O separately. 

If b>1 then \ = vmod 8. If v = 3 mod 8 then (—A/m 
impossible since J represents B rationally (3). The case v = 
of similarly. 


= —]1 but this is 


) 
5 mod 8 is disposed 


If 6 = 0 then Av = —1 or 3 mod 8 according as k — \ = 2 or 6 mod 8. 
Ifk — \ = 2 mod 8 and v = 3 mod 8 then (—A/m) = —1 which is impossible. 
Similar easy computations exclude all possibilities other than v = + 1 mod 


8. 


(ii) J; + S, represents (k — A)(J; + S,) in R(p) for all odd p such that pjk. 

We make use here, and below, of the following known theorem (6, 11). 

For odd p, two forms, f and g, in the same number of variables and of unit 
determinants in R(p) are equivalent in R(p) if and only if 


4) - ($4). 


The desired result is an immediate consequence of this theorem. We actually 
have somewhat more; namely, J; + S; represents (k— A) (J; + S,) in R(p) 
for all odd p such that p 7 (k — A)v. 


(iii) J, + S; represents (k — A) (J; + S;) in R(p) for all odd p such that 
p|\(k — ). Suppose (k — A) = p’m where (p, m) = 1 and b > 0. We consider 
two cases: (a) » = 1 mod 4, and (b) v = 3 mod 4. 

(a) Since J represents B rationally we must have (v/p) = (A/p) = 1, (2). 
Thus det (J; + S;) = v7 and det [m(J, + S,)] = m*'v are both units and 
perfect squares in R(p). Therefore, J; + S; and m(J,; + S;) are both equiva- 
lent in R(p) to J;. It is thus sufficient to show that J, represents p’J, in R(p). 
If 6 is even, this is obvious. Suppose then that b = 2c + 1. We use the device 
employed in (3). There exist integers a;, i = 1, 2, 3, 4, such that }>;‘a?7 = p. 
Let L be the }(v — 1) fold direct sum of 


ay, ae a3 a4 

‘ ae —ay, — as a3 
rs a3 a, —-a, —ad 
a4 —<d2s ae —da\ 


Then L7L = p***'J, as desired. 
(b) For v = 3 mod 4 we must have 








130 J. K. GOLDHABER 


(3). Thus J, +5, and m(J, + 5S;) are both equivalent to the (v — 1) 
(v — 1) diagonal matrix [1, 1,...,1, (—1)*"*] = J. We must show that 
J represents p’J. For even b this is obvious and so we may take b = 2c + 1. 
If p = 3 mod 4 then let L, be the }(v — 3) fold direct sum of the 4 by 4 matrix 
given in the previous paragraph and let 


-|4 ] 
=~? } ‘| 
where a is a p-adic integer such that a? = 1 + p. Then ((L; @L,)|? J[(Li @ 


L.)| = p’J. lf p = 1 mod 4 then there exist integers a; and az such that a;? + 
a? = p. Let L be the }(v — 1) fold direct sum of 


P a ade 
p a2 — : 
Then L7JL = p’J. 


It follows from all the above that for every p there exists a matrix C with 
elements in R(p) such that C7C = B. It remains to show that there exists a 
normal matrix with the desired properties. If p + v this is clear. In fact, we 
have seen above that for every p + v there exists a C, with elements in R(p) 
such that C,7 (1, + S,) CG, = (k —A)(, +S). Let A= T(kR@CQ) To. 
Then A? A = Band AS = kS, where S is the v by v matrix composed entirely 
of ones; whence by (5, Theorem 3.1) A is normal. Since p ¢ v, A has its 
elements in R(p). 

Suppose p|v. We know that there exists a matrix C = (¢,) with elements in 
R(p) such that C? C = B. Let a@ be the column vector [r, ro, . . . , r,] where 
r, = >-jcy, and let 8 be the v by 1 column vector each of whose entries is k. 
We will show that there exists an orthogonal matrix O with elements in R(p) 
such that Oa = 8B. It will follow that A = OC issuchthat A7A = B, AS = kS: 
and again by (5), A is normal as desired. 

We use the following theorem (4, Satz 10.4). It is stated here more con- 
cretely and in a less general form that in (4). 

Let V be a v dimensional vector space of column vectors over the p-adic 
field with a non-degenerate ground form given by the v by v symmetric matrix 
G. Let F be a lattice in V and let J be its different. If a and 8 are primitive 
vectors in F such that a7Ga = 87GB and a — 8 € JF then there exists a v 
by v matrix O with elements in R(p) such that O7GO = G and Oa = 8. 

For our purposes we take G to be the identity matrix, -F as the lattice 
which has as a basis the column vectors of the identity matrix J, and a and 8 
as above. We note that if p is odd then Z = F and if p = 2 then F is the 
lattice which has as a basis the column vectors of 27. From the fact that 
CTC = B it follows easily (10) that >>, ¢, 7; = k® and }. yw? = kv. From the 
first of the latter equations and the facts that p|v, (k, v) = 1 it follows that a 
and 8 are both primitive. From the second of these equations it follows that 
a™a = B78. Hence if p is odd the desired O exists. 


gi 


an 





INTEGRAL P-ADIC NORMAL MATRICES 131 


In order to complete the proof for p = 2 it is sufficient to show that r? = 1 
mod 2. Let 4, be the inner product of the ith and jth rows of C. Again as in 
(10) we have , 


kts, = Ar ij + k'(k = A) 54. 


If r2 = 0 mod 2 then ¢,, = 1 mod 2 and we would have 
2 2 ‘ 
0=r,= ya Ciy = tay = 1 mod 2 
5 | 


which is clearly absurd. 
This completes the proof of Theorem 1. 


As immediate consequences we have 


CoroLuary 1. If I represents B over the rational field then I represents B 
rationally without essential denominator, that is, for every positive integer m 
there 1s a matrix D with rational elements whose denominators are prime to 


m such that D™D = B. 


CorROLLARY 2. Jf I represents B rationally then there exists a form in the 
genus of I which represents B integrally. 


3. A modified incidence equation. Since the genus of the identity con- 
tains more than one class for v > 8 (8) Corollary 2 does not yield any new 
designs. It is natural, therefore, to examine a matric equation, akin to X7X 
B, which is still satisfied by every incidence matrix, has integral solutions, 
and then to examine the relationship of these integral solutions to incidence 
matrices. 

THEOREM 2. Let t = a/b be a rational number greater than 1/v such that 
(av — b)b is odd. Let S be the v by v matrix composed entirely of ones. If I repre- 
sents B rationally then I —tS represents B — tk®S integrally. 


For by Theorem 1, bJ — aS represents 6B — ak*S in every R(p). (The 
normality of A implies that SA = AS = kS and therefore A7(bJ — aS) A 
bB — ak*S.) Hence there exists a form in the genus of bJ — aS which represents 
bB — ak’S integrally. Since the genus of an indefinite form of odd determinant 
in v > 2 variables consists of exactly one class (9) Theorem 2 follows. 

Let - be the set of all rationals which have the properties stated in Theorem 
2. For t€ * let A(t) = (ay(t)) denote an arbitrary but fixed integral solution 


of X7™(I — tS)\X = B — th’S. Let rt) = ¥, ay(t), and s,(t) = Y; ay(t). 


THEOREM 3: (i) Jf A(to) is normal and to # (k + (k — )*)/kv then A(to) is 
an incidence matrix or the negative of one. 

(ii) If for ty, te § Sty ~ te we have ry(t;) = ry(te) (sets) = sy (te)) for 
t= 1, 2,3,...,v, then A(t) ts an incidence matrix or the negative of one. 

(iii) If there exists an M and a subset / of S containing sufficiently many 
distinct elements (see below) such that |r,(t)| < M(\s(t)| < M) fort ¢ S' and 











132 J. K. GOLDHABER 


i = 1,2,...,v then there existsaty € /' such that A(to) is an incidence matrix 
or the negative of one. 

(iv) Tf rite) > O fort = 1,2,...,0 and for to > 1 then A(t) is an incidence 
matrix. 

(i) As in (10) the following relations may be established: For every t € -% 


> rit) - (> rd) = k’v(1 — tv) 
R(1 — m( st) + (kt — (5 rd) = k*(k — d)v. 


Now the normality of A(t) implies that }>r?(to) = Sos ?(to). Since tp # (k + 
(k — d)*)/kv, the above equations imply that Sor?(to) = kv and Ys,(to) = 
dri(to) = + kv. Whence r;(to) = sito) = Rk or ri(to) = sto) = — R for all 7. 
But then A7A = AA®™ = B and the result follows by (10, Theorem 2.1). 

(ii) The proof of this result is analogous to the proof of (i). 

(iii) The number of lattice points in v dimensional space over the reals with 
components having absolute value less than M is finite. Hence if -”’ contains 
more elements than the number of such lattice points then there exist 4;, 
te € SY’, t ¥ te, such that 7,(t;) = r(t2) for i = 1,2,...,v. The desired 
result follows from (ii). 

(iv) Once again as in (10) it may be shown that 7,(t) = 0 mod k. Since 
r,(t) > 0 it follows that 


(x ra) > (x rw) + v(v — 1)k’. 
From the first of the equation given in (i) above it follows that 
kv(l —t) < (1-2) > rit). 
Since t > 1 we have }-r?(t) < k*v. But also 


kv? < (LD rat)? < oD ri). 


Hence }>r?(t) = kv and the proof may be completed as was the proof of (i). 
We remark that if v > k + 1 and > 1 then 7,(t) + 0 fori = 1,2,...,0. 
Theorem 3 gives sufficient conditions for the existence of a A, k, v design in 

terms of integral solutions, which by Theorem 2 are known to exist, of the 

matric equation 


X7(1 — tS)X = B — tk’S. 


The problem of determining the nature of these solutions appears to be ex- 
tremely difficult. Also of interest, and possibly a more pliable problem, is the 
determination of the integral automorphs of J — tS and B — tk?S. 





} INTEGRAL P-ADIC NORMAL MATRICES 133 


REFERENCES 


1. A. A. Albert, Rational normal matrices satisfying the incidence equation, Proc. Amer. Math. 
Soc., 4 (1953), 554-9. 
2. R. H. Bruck and H. J. Ryser, The nonexistence of certain finite projective planes, Can. J. 
| Math., / (1949), 88-93. 
S. Chowla and H. J. Ryser, Combinational problems, Can. J. Math., 2 (1950), 93-9 
. M. Eichler, Quadratische formen und orthogonale gruppen (Berlin, 1952). 
5. Marshall Hall and H. J. Ryser, Normal completions of incidence matrices, Amer. J. Math., 
76 (1954), 581-9. 
6. B. W. Jones, A canonical quadratic form for the ring of 2-adic integers, Duke Math. J., 11 
(1944), 715-27. 
7. ——— The arithmetic theory of quadratic forms, Carus Math. Monographs, 10 (1950) 
8. W. Magnus, Ueber die Anzahl der in einem Geschlecht enthaltenen Klassen von positiv defi- 
niten quadratischen Formen, Math. Ann. 114 (1937), 465-75. 
9. A. Mayer, Ziirich naturf. Ges., 36 (1891), 241. 
10. H. J. Ryser, Matrices with integer elements in combinational investigations, Amer. J]. Math., 
74 (1952), 769-73. 
11. C. L. Siegel, Equivalence of quadratic forms, Amer. J. Math. 63 (1941), 658-80 


pe 


Washington University 











LOOPS WITH ADJOINTS 
W. R. COWELL 


Introduction. It is shown in (6) how to represent certain sets of ortho- 
gonal Latin squares as a group together with a set of permutations of the 
group elements. The correspondence between 3-nets and loops is well known; 
for example, see (8). We shall consider a loop G together with a certain set 
of permutations on the elements of G and shall interpret such a structure as 
an incidence system in which the 3-net of the loop is embedded. Specifically, 
the permutations or 
to the 3-net of G in the sense of (3). The group of autotopisms of the loop 
determines a group of automorphisms of its 3-net analogous to collineations 


“ 


adjoints”’ will give rise to lines which may be adjoined 


in an affine plane. We shall study the problem of extending these incidence 
preserving mappings to the adjoined lines. By analogy with the study of 
loops with operators, we shall consider homomorphisms of loops with adjoints 
and examine geometric consequences. Particular attention will be paid to the 
case where G has the inverse property and the adjoints are “‘linear.’”’ The 
special case in which G is an abelian group is of geometric interest in that the 
corresponding incidence systems include the Veblen-Wedderburn affine planes. 


1. Nets, loops, and adjoints. A zet ® is a set of undefined objects 
called “‘points’’ and “‘lines,” together with a symmetric incidence relation 
(point on line, line through point), such that the lines can be partitioned 
into non-empty, disjoint subsets called ‘‘parallel classes’’ and the following 
incidence axioms hold: 

(i) Any point of ® lies on exactly one line of each parallel class. 

(ii) Any pair of lines from distinct classes have exactly one point in com- 
mon. 

(iii) There are at least three distinct parallel classes. 

An affine plane is a net which satisfies 

(iv) Given two distinct points of M, there is a unique line containing both 
of them. 

(v) There exists a set of four distinct points of N, no three of which lie on 
the same line. 

If a net ® possesses a finite number & of parallel classes, one refers to 
as a k-net. 

Suppose L is a set of points of the net ® such that if M is any line of N, 


Received August 16, 1958. 
134 


ta 





LOOPS WITH ADJOINTS 135 


then L contains exactly one point of M. We say that L may be “‘adjoined as 
a line to ¥t.”’ 

Let (G, +) be a loop. By the 3-net associated with G we mean the net 2(G) 
whose points are ordered pairs (x, y), x and y in G, and whose three classes of 
lines are given by x = c, y = c, and y = x + ¢ where c ranges over G and 
incidence means satisfaction of the equation. 

Let ¢ be the identity map on the loop G. An adjoint of G is a permutation 
a on G for which there exists a permutation rt on G such that «+ 7 = oa, 
where addition of mappings is defined by adding images. The permutation 
r is a ‘‘complete mapping”’ in the sense of (6) and will be called the companion 
ol o. 


THEOREM 1. A line L can be adjoined to N(G) if and only if G possesses an 
adjoint c. 


Proof. Suppose ¢ is an adjoint of G with companion r. Define L to be the 
set of points (x, xa), x € G. If ec € G then L contains exactly the point (c, ca) 
of the line x = c, the point (co—', c) of the line y = c, and the point (cr™', 
cr~'c) of the line y = x + c. Thus L may be adjoined to R(G). 

Conversely, let L be a set of points which is adjoined as a line to R(G). If 
(a, 6) < L, define ae = b. Since L contains exactly one point from each line 
x = c and each line y = c, we see that o is a permutation on G. Define the 
mapping r of G into G by the equation ae = a + ar, a € G. For each ¢ € G, 
the line y = x + c passes through exactly one point (a, ac) of L. Hence r is 
a permutation and ¢ is an adjoint of G. 

The incidence system consisting of 2t(G) together with the adjoined lines 
associated with a set 2 of adjoints of G will be called the quasinet Q(G, =). 

A set = of adjoints of G is said to be compatible if, for every distinct pair 
o1, 72 in &, there is at most one x € G such that xo; = xo»; that is, the corre- 
sponding lines of Q(G, =) share at most one point. 


2. Adjoints under isotopy. The loops (G, +) and (G, @) defined on 
the same set G are said to be isotopic if there exists an ordered triple (a, 7, 8) 
of permutations of G such that xa @ yy = (x + y)8. We write (G, +) = 
(a, y, 8)(G, ®). If ® is the same operation as + then (a, y, 8) is called an 
autotopism of (G, +). Isotopy is an equivalence relation on the set of all 
loops and the autotopisms of a loop form a group which contains the auto- 
morphism group of G. We refer the reader to (2) for a discussion of the alge- 
braic properties of isotopy and autotopy. 

A homomorphism of a net ® onto a net VM’ is a mapping of points onto 
points and lines onto lines which preserves incidence and parallelism and is 
one-one on classes of lines. If the homomorphism is one-one on points and 
lines, it is called an tsomorphism. 

The proofs of the following two well-known theorems may essentially be 
found in (8). 











136 W. R. COWELL 


THEOREM 2. The mapping 


(x, y) — (xa, yB) 
[x = c] — [x = ca] 
ly = c] > [y = cB] 
ly=x+cl—[y =x @ cy] 


ts an tsomorphism of N(G, +) onto N(G, @) if and only if (G, +) = (a, y, 8) 
(G, @). 


THEOREM 3. Let N be an arbitrary 3-net and let O be a point of N. Let the 
designations x = c, y = c, and y = x + ¢ be assigned to the three classes. Then, 
for some appropriate loop G, N = N(G), O is the point (0,0), and the three 
classes are as designated. Moreover, if a different point O' is chosen and the class 
designation remains the same, then N = N(G') for some G' isotopic to G. 


It is clear that isomorphic nets have corresponding adjoined lines; thus 
isotopic loops have corresponding adjoints. The correspondence is given in 
the proof of 


THEOREM 4. Let (G, +) be a loop with an adjoint c. Suppose (G, +) = (a, 
vy, B)(G, ®). Then (G, ®) possesses a unique adjoint a’ so that the isomorphism 
of Theorem 2 can be extended to an incidence preserving mapping of the associated 
adjoined lines. 


Proof. The set of images of the points (x, xc), x © G is a line adjoined 
to N(G, ®) if and only if (xa, x08) = (xa, xac’) for each x where o’ is an 
adjoint of (G, ®). If the condition holds,then o’ = a~'s8 and we show that 
this o’ is an adjoint for (G, ®). Let ¢ have companion r. Then, for all x © G, 


x(e @® a'ry) = xa"'ale @ ary) = (xa )a @ (xa~'r)y = 
(xa! + xa~'r)B = xa"(e + 7)B = xam'oB. 
Thus the companion of a~'e8 is a~'ry. 


If V = (a, y, 8) is an autotopism of a loop G and & is a set of adjoints of 
G, then we say that V is extensible relative to = if a~'28 = LY. From the proof 
of Theorem 4, we see that the extensible autotopisms are exactly those for 
which the automorphism of ¥(G) given by Theorem 2 induces a line onto 
line, incidence preserving mapping of 2(G, =). 

An example of an extensible autotopism is furnished by a projective plane 
with a collineation which leaves fixed every point of some line 7 Let 7 be 
used as the line at infinity and construct a co-ordinate system as in (4). Take 
G to be the additive loop of the ternary and let = be the set of mappings 
x —>x.mob where m assumes some set of values exclusive of 0,1, ©, and 
where } takes on all values from the ternary for each m. The collineation then 
induces an automorphism of 9t(G) corresponding to an autotopism of G which 
is extensible relative to = by virtue of the fact that adjoined lines are mapped 
into adjoined lines. 











LOOPS WITH ADJOINTS 137 


3. Homomorphisms. Let H be a normal subloop of G and let » be the 
natural homomorphism of G onto G/H. Then we may define a homomorphism 
of N(G) onto N(G/H) by a mapping of the same form as that used in Theorem 
2 with a = 8 = y = 9 and @ taken as +, the operation in G/H. Further- 
more, it follows easily from Theorem 3 that every homomorphism of a 3-net 
onto a 3-net can be viewed in this way for appropriate G and H. 

Suppose ¢ is an adjoint of G with companion r. A normal subloop H of G 
is a a-subloop provided any one of the following implies the other two: (i) 


x = ymod H, (ii) xo = yo mod H, (iii) xr = yr mod H. If = is a set of 
adjoints, then H is a normal 2-subloop if H is a normal o-subloop for each 
a € &. This definition states that o* defined by (x + H)o* = xo + H is a 


permutation of G/H. The same statement applies to r* and, moreover, 
(x + H)(e* + 7*) = (xn + A) + (4 + A)r* = (xX + A) + (x7 + A) 

(x + x7) + H = x0 + H = (x + A)o* for all x and thus o* is an adjoint 
of G/H. Furthermore, the point (x, x7) maps into (x + H, (x + H)e*) so the 
adjoined line of N(G) defined by ¢ maps onto the adjoined line of N(G/H) 
defined by o*. 

Suppose, conversely, that H is normal in G, that o is an adjoint of G, and 
that the images (x + H,xo + H) form a line adjoined to N(G/H). Then 
the mapping x + H — xe + H is an adjoint of G/H. Hence, x + H = y+ H 
if and only if xo + H = yo + H. Also, if x’ + H is the unique solution of 
(x + H) + (x' + H) = xo + -H, then x + H- x’ + H is a permutation of 
G/H sox + H = y+ H if and only if x + H = y + H. But xc +H 
(x + x’) + H = (x +27) +H so x’ +H =x7r+H. Therefore, H is a 


normal o-subloop of G. 


4. Linear adjoints and extensibility. If @ © G, the permutation p(a) 
is defined by xp(a) = x + a. A permutation of G is linear if it has the form 
oa = 5p(a) where 6 is an automorphism of G. We say that o is strongly linear 
if a is in the associator (see 2) of G. An adjoint o of G is linear (strongly linear) 
if both o and its companion are linear (strongly linear) as permutations. If a 
linear adjoint ¢ has a = 0, o@ is an automorphism adjoint. 


LEMMA 1. (i) If o = dp(a) is a linear adjoint with companion yp(b) then 
a = b. If o is strongly linear, then 6 is a strongly linear adjoint with companion y. 

(ii) If o is any adjoint on G with companion r and a is in the associator of G 
then ap(a) is an adjoint with companion rp(a). 


Proof. (i) O(e + yp(b)) = Odp(a) gives a = b. Under strong linearity, 
x6 +a=x+ (xy +a) = (x + xy) + a forevery x € Gand thusé = « + y. 

(ii) x(e + rp(a)) = x + (xr +) = (x + xr) + a = xaop(a). 

A set = of strongly linear adjoints of G will be called complete if, for every 
automorphism 6 such that dp(a) € 2, 5p(b) € = for every 6b in the associator 
of G. Lemma 1 guarantees that 5p(6) is an adjoint. 

Bruck (2, ch. m, § 4) has studied three types of autotopisms of a com- 











138 W. R. COWELL 


mutative loop G with the inverse property and has shown that these auto- 
topisms generate the autotopism group of G. The following theorem shows that 
certain subclasses of types (1) and (2) and all autotopisms of type (3) are 
extensible relative to a complete set of strongly iinear adjoints. 


THEOREM 5. Suppose G is a commutative loop with the inverse property and > 
is a complete set of strongly linear adjoints of G. 

(i) If U = (a, a, a) where a is an automorphism of G, then U is extensible 
relative to = if and only if a~'Aa = A where A is the set of automorphisms in >. 

(ii) If V = (p(a), p(a), p(a)*) where a is in the associator of G, then V and 
all the autotopisms obtainable from V by Bruck’s Lemma 4A are extensible 
relative to >. 

(iii) If W = (e, p(a), p(a)) where « is the identity and a is in the associator 
of G, then W and all the autotopisms obtainable from W by Bruck's Lemma 
4A are extensible relative to >. 


Proof. (i) Suppose U is extensible. Let 5p(6) € 2. Then, for all x € G, 
xa dp(b)a = xa-ba + ba = xy + c for some vy € A, ¢ an associator element. 
If x = 0, ba = ¢ and aba = y. Similarly ada C A. 

Conversely, assume a~'Aa = A. Then, if x € G, xa bp(a)a = x(a~ba)p(aa) 
and xadp(a)a~! = x(ada—')p(aa-'). Since the associator is a characteristic sub- 
loop of G, U is extensible. 

(ii) Consider xp(a)—'dp(b)p(a)? = [(x — a)é + 6] + 2a = xdp(c) where c 
is in the associator. Similarly, p(a)ép(b)p(a)~? € >. 

Extensibility for the autotopisms obtained by using Bruck’s Lemma 4A 
may be verified with a certain amount of similar computation. 

(iii) We compute xe~'dp(b)p(a) = (x6 + 6) + a = xdp(c) and xedp(b)p(a)— 
= (x6 + b) — a = xép(d) where c and d are in the associator. 

Again, it is straightforward to verify that the “derived” autotopisms are 
extensible. 


THEOREM 6. Let G be an abelian group and let = be a complete set of (strongly) 
linear adjoints of G. If A is the set of automorphisms in &, let 5; — 52 be a per- 
mutation (and hence an automorphism) of G for every pair 5, 52 € A, 8; ¥ be. 
Then, the quasinet SQ(G, =) is a net each of whose parallel classes, besides those 
of N(G), consists of the set of adjoined lines given by the adjoints dp(c) where 6 
is fixed and c ranges over G. Moreover, the automorphisms of 2(G, =) are 
exactly given by (x, y) — (xa + 17, ya + s) where a is an automorphism of G 
in the centralizer of A and r, s are in G. 


Proof. The point (a, b) € Q(G, Z) is on exactly the line determined by 
5p(c), c = b — aé, in the class corresponding to 6. Any line from an adjoined 
class contains exactly one point in common with each line of R(G) and thus 
we need only consider lines determined by 65,p(a), 529(b) where 6; ¥ 42. It is 
easy to see that these lines share exactly the point (x, y) where x = (b — a) 
(6, — 52)—', y = xbip(a) = xb29(b). Hence Q(G, =) is a net. 


\y 


LOOPS WITH ADJOINTS 139 


Every automorphism of the net ©(G, =) induces an automorphism of 2(G) 
which, by virtue of Theorem 2, corresponds to an autotopism of G. By Bruck's 
Theorem 4D (2, ch. 11, § 4), this autotopism is of the form UVW where 
U, V, and W are of types (i), (ii), (iii) respectively, as described in Theorem 5. 
It is easy to check that V and W and hence VW correspond to “‘translations”’ : 
(x,y) — (x +r, y + s). Thus, the autotopism UVW corresponds to (x, y) — 
(xa +r, ya +s) where a is an automorphism of G. Moreover, UVW is 
extensible in that it gives an automorphism of ©(G, 2). Furthermore, by 
Theorem 5 (ii), (iii), VW is extensible and thus p(r)Zp(s)-' = ¥. Therefore, 


axta~' = alp(r) Zp(s)—"ja~! = [ap(r)|Zlap(s)}-' = &, 


showing that (a, a, a) is extensible and giving, by Theorem 5 (i), ada~' = A. 
But, even more, the translations, the automorphisms of O(G, 2), and hence 
also the mappings (x, y) — (xa, ya) preserve parallel class. If we note the 
proof of Theorem 5 (i), we see that a~'da = 6 for each 6 € A. 

To prove the converse, suppose we have a mapping of Q(G, =) as described 
in the statement of the theorem. The translations are automorphisms of 2(G) 
and the corresponding autotopism is a product of autotopisms of types (ii) 
and (iii): 

(p(r), p(s — r), p(s)) = (p(r), p(r), p(r)?) (e, p(s — 2r), p(s — 2r)). 


Thus the translations correspond exactly to such products and, by Theorem 5 
(ii), (iii), are extensible. In fact, they preserve parallel class. Also, by the 
proof of Theorem 5 (i), (a, a, a) is extensible and preserves parallel class in 
the extended net. Thus our mapping corresponds to the autotopism (ap(r), 
ap(s — r), ap(s)) which is extensible and preserves parallel class and therefore 
gives an automorphism of Q(G, =). 

The translations (x, y) — (x + 7r,y +s) are transitive on the points of 
£1(G, =) and we see that if Q(G, =) is an affine plane, it is Veblen-Wedder- 


burn (8). 


5. Linear adjoints and homomorphisms. 


LemMA 2. Let G be a loop and a = ip(c) be a linear adjoint with companion 
yp(c). Then a normal subloop H of G is a a-subloop if and only if Hi = Hy = H. 


Proof. Assume Hé = H. Then x6 + c¢ = (yi +c) +h for h € H if and 
only if x6 + c = (yi + h’) + ¢ for h’ € H if and only if x6 = yi + A’. But 
yi +h! = yi + h"5 = (y + h’")6 for h” © H. Thus xép(c) = yip(c) +h if 
and only if x = y+ kh”. Similarly, Hy = H implies x = ymod JH if and 
only if xyp(c) = yrp(c) mod H. 

Now assume that H is a normal e-subloop. Then h = 0 mod H if and only 
if hép(c) = O5p(c) mod H if and only if hi + c = cmodH if and only if 
hé = 0 mod H; that is, A € H if and only if 46 € H. Similarly, Hy = H. 

We remark that Hé C H if and only if Hy C H; for, ifh € H,hi+c=h 
+ (hy +) = (h' + hy) +c for some h’ € H. Hence hi = h’ + hy and thus 








140 W. R. COWELL 


hé € H if and only if hy € H. Therefore, if G is an abelian group with descend- 
ing chain condition or if G is a finite loop, we need only assume either H6 C H 
or Hy C H in order to prove that a normal subloop H is a normal o-subloop. 

In any loop G, multiplication of mappings of G into G is left distributive 
over addition of mappings. If G has the inverse property, one readily verifies 
that the mappings of G into G form a loop (under addition) with the inverse 


property where — a is defined by x(— a) = —(xa). Therefore, — (a + 8) 
= (— B) + (— a). Also, — @ is one-one (onto) if and only if @ is one-one 
(onto). 


LemMA 3. Let G be a loop with the inverse property and let o, = 6,p(c:) and 
a2 = b2p(C2) be strongly linear adjoints on G where 6, — 62 is one-one on G. Then 

(i) o, and a2 are compatible, 

(ii) if H is a normal {o,, o2}-subloop, the induced adjoints o*, and o*>» are 
strongly linear on G/H. If H © H(é, — 52), then 6*, and 6*> are compatible. 


Proof. (i) Assume x6; + c; = x2 + co. Then — (xb2) + (x6) + ¢)) Ce 
and — (xée) + x6; = x(— 62 + 6;) = co — cc). The solution is unique if 
— 5, + 5; is one-one. Since 5; — 52 is one-one so also is — (6; — 62) = 62 — 4). 
Define the mapping @ on G by x8 = — x. We see that @ is a permutation of 
G and that if a is an automorphism of G, then 6a = — a. Therefore, 

OA(50 — 6:) = O52 + O(— 6;) = — be + 0(06;) = — 62 + (00)6; = — 52 4+ 35; 
is one-one. We note that if 6, — 52 is a permutation then x6; + ¢; = xb. + C2 


possesses a (unique) solution. 

(ii) For 1 = 1, 2, (x + A)o*, = xo, + H = (x6, + ¢,) + H = (x6, + A) 
+ (c, + H). Clearly, c; + H is in the associator of G/H and, by Lemma 2, 
6*,:x + H-—-xé,+ AH is an automorphism of G/H. In the same way, the 
companion of o*, is a strongly linear permutation and thus o*;, is a strongly 
linear adjoint. 

Next, assume (x + H)é*, = (x + H)é*.. Then, for some h € H, x6; =h 
+ x5. and thus x6, — xd. = x(6, — 62) = h = h’(6; — 62) for some fh’ € H. 
Since 6; — 652 is one-one, x € H and hence 0 + H is the unique solution of 
X6*, = XS*>. 


THEOREM 7. Let G be an abelian group satisfying the descending chain con- 
a 


dition. If = is a complete set of compatible linear adjoints and H is a =-subgroup, 
then the quasinets 2(G, 2) and Q(G/H, =*) are nets in the sense described 
in Theorem 6. 


Proof. Let A be the set of automorphisms in = and suppose 6, and 42 are 


distinct elements of A. Now 6; — 62is an endomorphism of G and x(6; — 62) = 0 
implies x6; = xéde. Since 5, and 5, are compatible and since 06, = 052 we see 
that x = 0; that is, the kernel of 5; — 62 is 0 and 6; — 4, is one-one and has 
a right inverse. By the descending chain condition, G(6,; — 62)"*! = G(6,—42)" 
for some r and thus G(é; — 52) = G showing, by Theorem 6, that Q(G, >) 
is a net. 





eu 








LOOPS WITH ADJOINTS 14] 


The set =* of induced adjoints of G/H is certainly complete. Let 6*, and 
5*, be distinct elements of A*. Then 6*, — 6*, is an endomorphism of G/H 
and (x + H)(é*, — 6*:) = H implies (x + H)é*,; — (x + H)é*, = (xd; — xb») 
+ H = x(6, — 6.) +H =H. Thus x(6; — 6.) € H. But H(é, — 6.) CH 
because H is a =-subgroup and, by the descending chain condition, there is 
an integer s such that H(6,; — 6,)*+' = H(é, — 6.)*. As above, H(é, — 6.) = H 
giving x € H and showing that 6*, — 6*, is an isomorphism. Since G/H satisfies 
the descending chain condition, Q(G/H, =*) is a net as before. 

If = is a set of strongly linear adjoints on a loop G, we have seen that the 
set A of automorphisms in = plays a special role in identifying normal - 
subloops and in questions of compatibility and extensibility. We shall focus 
attention now on A and the normal A-subloops, realizing that, for every 
6 € A and every a in the associator of G, there is an adjoint dp(a) whose 
normal subloops are exactly the normal 6-subloops. 


LemMMA 4. Let G be a finite loop of order n > 1 and suppose A is a set of 
automorphism adjoints such that every pair in A is compatible. Then the number 
of elements in A is not more than n — 2. 


Proof. The lines x = 0, y = 0, and y = x of N(G) have exactly the point 
(0,0) in common. Further, (0,0) is the point shared by each of these three 
lines and the adjoined line determined by 6 € A. Moreover, by compatibility, 
every pair of adjoined lines shares exactly this point. Let d be the number 
of elements in A. N(G) has n? points and we count the points on the above 
d + 3 lines: 

(d+3)(nm —1) +1 < n?. 
Rejecting m = 1, we have d < n — 2. 


Definition. A compatible triple (G, 4, H) consists of a loop G with the inverse 
property, a set A of automorphism adjoints on G and a normal A-subloop 
H of G such that, for every distinct pair 6, and 6, in A, 6; — 42 is one-one on 
G and is a permutation on H. The degree of A (deg A) is the number of ele- 
ments in A. (G : H) denotes the index of H in G. 


THEOREM 8. Jf (G, A, H) is a compatible triple where A has finite degree, then 
either (G:H) = 1 or (G: H) > deg A + 2. 

Proof. Suppose (G :H) > 1. By Lemma 3 (ii), G/H is a loop with auto- 
morphism adjoints A* and every pair 6*,, 6*, from A* is compatible. More- 
over, deg A = deg A* for suppose (x + H)é*,; = (x + H)é*: for all x. Then, 
for each x € G, there is an h € H such that x6, = h + xe or x(6; — 82) = A. 
But 6; — 6: is a permutation on H and hence x € H, contradicting our assump- 
tion that G # H. If (G: A) is finite, apply Lemma 4 to the loop G/H with 
adjoints A* to obtain deg A = deg A* < (G: H) — 2. 


CoroO.uary 1. If the inverse property loop G contains a characteristic normal 
subloop H of index 2 then G has no automorphism adjoints. 








142 W. R. COWELL 


Proof. lf 6 is an automorphism adjoint, take A = 6 so deg A = 1. (G, A, H) 
is a compatible triple and thus 2 > 1 + 2, a contradiction. 


Example. The symmetric group on m symbols has no automorphism adjoints. 


COROLLARY 2. Let G be a finite loop with the inverse property and let E be a 
characteristic normal subloop of G. Then, if (G, A, H) is a compatible triple for 
any H, deg A < (G: E) — 2. 


Proof. (G, A, E) is a compatible triple because E is a A-subloop since it is 
characteristic and, by finiteness, 6; — 52 is a permutation on E for every 
distinct pair from A. 


6. Irreducible sets of linear adjoints. 


y 


Definition. Let G be a loop with a set = of adjoints. We say that ® is irre- 
ducible if G has no proper normal 2-subloops. 


THEOREM 9. Let G be a loop with a set = of adjoints so that the quasinet 2(G, >) 
is an affine plane. Then & is irreducible. 


Proof. Given x # 0, y # 0, x # y in G, there is a unique line of the plane 
through (0, 0) and (x, y). If the line is in N(G), either x = 0, y = 0, or y = x 
but these are excluded. Therefore, there exists ¢ € = such that 00 = 0 and 
xa = y. Now suppose H is a proper normal =-subloop of G. Choose x = h # 0 
in H and y¢H. Then h = 0 mod H implies he = Oc = 0 mod H and thus 


ho = y is in H, a contradiction. 


LemMA 5. Let G be a finite loop with an irreducible set = of strongly linear 
adjoints. Let A be the set of automorphisms belonging to the adioints of = and 
let Q be the centralizer of A in the semigroup of endomorphisms of G. Then the 
non-zero elements of Q are automorphisms. If G is an abelian group, the finiteness 
restriction can be dropped. 


Proof. Let K be the kernel of w€Q@. Then k€ K, 6€ A implies 
(kd)w = (kw) = 05 = O and thus Ké C K. But w commutes also with 67! 
giving Ké = K. Now e + y = 6 and since w is an endomorphism, multipli- 
cation by w@ is distributive on both sides over mapping addition. Thus, 
w + yw = bw and w + wy = wi showing wy = yw. As before, Ky = K. By 
Lemma 2 and irreducibility, K = G or K = 0 and hence w is the zero endo- 
morphism or is one-one on G. In the latter case, since G is finite, w is an 
automorphism. If G is an infinite abelian group, Gw = (Géd)w = (Gw)é and 
Gw = (Gy)w = (Gw)y showing that Gw is a (normal) =-subloop. Therefore, 
either Gw = 0, in which case G = 0, or Gw = G. In either case w is an auto- 
morphism of G. 

THEOREM 10. Let G be a finite loop or an abelian group with an irreducible 


complete set = of strongly linear adjoints whose subset of automorphisms is A. 
Let Q be the centralizer of A in the endomorphisms of G. Then the autotopism 











LOOPS WITH ADJOINTS 143 


(w,w,w) with w € Q, w not zero, is associated with an automorphism of N(G) 
which leaves fixed every line in 2(G, =) through the point (0,0). Conversely, 
every such automorphism of N(G) is paired with an autotopism of the type 
described. Moreover, these autotopisms are extensible relative to . 

Proof. The automorphism associated with (w,w,w) is (x, y) — (xw, yw). 
This obviously fixes the lines x = 0, y = 0, and y = x of R(G). If dp(a) € = 
determines an adjoined line through the origin, evidently a = 0 and 
(x, x5) — (xw, xdw) = (xw, (xw)5). Thus the line determined by 6 maps into 
itself. 

Conversely, let (a, y, 8) give an automorphism which fixes lines through 
(0,0). Then, in particular, y = x is fixed and hence (x, x) — (xa, x8) for all 
x implies a = 8. Therefore, (x + y)a = xa + yy for every x and y. Since 
(0, 0) is fixed, Oa = 0 and we set x = 0 giving ya = yy. Therefore, the auto- 
topism arises from an automorphism, a = 8 = y of G. If 6 € A, the corre- 
sponding line is fixed and hence (x, x5) — (xa, xa) = (xa, (xa)5) showing that 
a is in the centralizer of A. 

Extensibility follows immediately and, in fact, if G and © satisfy the 
hypothesis of Theorem 6, a gives an automorphism of the quasinet Q(G, &). 

Suppose G is an abelian group and =, A, Q are as in Theorem 10. Then 2 
is a ring and, in fact, a division ring by Lemma 5. Therefore, we may regard 
G as a vector space over 2 and we note that the elements of Q different from 
the zero and the identity are automorphism adjoints of G. If OQ(G, z) isa 
Veblen-Wedderburn plane as in Theorem 6, one may verify that 2 is André’s 


“Kern” (1). 


7. A class of examples. The neofields of Paige (7) and their generaliza- 
tions, the division neorings of Hughes (5), provide a class of examples of 
loops with adjoints. If (D, +, -) is a division neoring (see 5 for definition), 
take G to be (D, +) and define the mapping R, on G by xR, = xa where 
a © D. Clearly, R, is a permutation for every a # 0. Then, every R, with 
a # 0,a ¥ 1 isan automorphism adjoint on G because (x + y)R, = xR,+yR, 
for every c © D and x(e + R,) = x + xb = x(1 + 5) = xR, where b # 0 is 
the unique solution of 1 + 6 = a. If E is any subdivision neoring for which 
(EZ, +) is normal in (D, +), then (Z, +) is a normal A-subloop where A 
consists of the mappings R,, e € E, e # 0, e # 1. Moreover, if (D, +) has 
the inverse property, then ((D, +), 4, (EZ, +)) is a compatible triple. 











144 W. R. COWELL 


oun & 


REFERENCES 


- Johannes André, Ueber nicht-Desarguessche Ebenen mit transitiver Translationsgruppe, Math. 


Zeitschr., 60 (1954), 156-186. 


- R. H. Bruck, Contributions to the theory of loops, Trans. Amer. Math. Soc., 60 (1946), 245 


354. 
——— Finite nets, 1. Numerical invariants, Can. J. Math., 3 (1951), 94-107. 


- Marshall Hall, Projective planes, Trans. Amer. Math. Soc., 54 (1943), 229-277. 
- D. R. Hughes, Planar division neo-rings, Trans. Amer. Math. Soc., 80 (1955), 502-527. 
- Henry B. Mann, The construction of orthogonal Latin squares, Ann. Math. Stat., 13 (1942), 


418-423. 


- Lowell J. Paige, Neofields, Duke Math. J., 16 (1949), 39-60. 
. Giinter Pickert, Projektive Ebenen (Berlin, 1955 


Montana State University 




















ON QUADRUPLE SYSTEMS 
HAIM HANANI 


1. Introduction. Given a set E of n elements we denote by S(/, m, n), 
(1 << m <n) a system! of subsets of E, having m elements each, such that 
every subset of E having / elements is contained in exactly one set of the 
system S(l, m,n). 

It is clear (3), that a necessary condition for the existence of S(/, m,n) is 


that 


(1) ("-*) /(™=*) = integer, (h = 0,1,...,/—1). 
(")/() 
l l 


is the number of elements of S(/, m,n) and 


(. _ fi - *) 

l—h l—h 

is the number of those elements of S(/, m,n) which contain h fixed elements 
of E. 

It is known that condition (1) is not sufficient for S(/, m,n) to exist. It 
has been proved that no finite projective geometry exists with 7 points on 
every line.* This implies non-existence of S(2, 7, 43). 

There arises a problem of finding a necessary and sufficient condition for 
the existence of S(l/, m,n), or more precisely, of finding—for given values of 
l and m—all values of m for which S(/, m, n) exists. 

Already in 1852 Steiner (6) (see also (4)) raised the following problem: 

(a) For what integer N is it possible to form triples, out of N given ele- 
ments, in such a way that every pair of elements appears in exactly one 
triple? 

(b) Assuming (a) solved we require the further possibility of forming 
quadruples so that any three elements, not already forming a triple, should 
appear in exactly one quadruple and that no quadruple should contain a 
triple. Does this impose new conditions on the number V? 

(c) Assuming (a) and (b) solved, can we moreover form quintuples so 
that any four elements, neither forming a quadruple nor containing a triple, 





Received February 9, 1959. 

'The term “family’’ would be more appropriate, but for historical reasons we shall use the 
term “system.” 

*This follows from the proof by G. Tarry that the “36 Offiziere” problem of Euler has no 
solution (8). 


145 














146 HAIM HANANI 


should appear in exactly one quintuple and that no quintuple should contain 
a quadruple or a triple? Does this impose further restrictions on the number V? 

Steiner carries on stating analogous problems for sixtuples, septuples, etc. 

The problem of Steiner is essentially equivalent to that of S(m — 1, m, n) 
systems. The special case of S(2,3,m) constitutes Steiner’s famous triple 
problem (a), and S(3, 4, 2) are equivalent to Steiner’s quadruple problem (b) 
with nm = N + 1. It can easily be seen that by adding an additional element 
to the system described in (b) and by joining it to every existing triple, a 
S(3,4, N + 1) system evolves. 

The present form of the problem is due to Moore (3). 

So far the problem has been solved completely only for 1 = 2,m = 3, that 
is, for triple systems: from (1) it follows that a necessary condition for the 
existence of S(2, 3, ”) is 


(2) n=1 or 3(mod 6); 


on the other hand, Reiss (5) and later independently Moore (2) proved that 
this condition is also sufficient. Other results are limited to special values of 
l,m,n. A list of systems S(/ ,m,n) which are known to exist can be found 
in (7). 

In the case of / = 3, m = 4, that is, of quadruple systems, it follows from 
(1) that a necessary condition for the existence of S(3,4,m) is m=2 or 4 (mod6). 
The object of the present paper is to prove that this condition is also sufficient. 


2. Definitions and notation. 


2.1. The systems P.(m). Given a set of 2m elements 0,1,...,2m — 1, we 
decompose the m(2m — 1) unordered pairs [r,s] formed from them into 
2m — 1 systems P,(m), (a = 0,1,...,2m — 2), each containing m mutually 


disjoint pairs.* 
For m = 0 (mod 2) we form the systems P,(m) as follows: 


P.23(m) = {[2a,2a + 28+ 1]: a 


Pogii(m) = {[2a,2a — 28 — 1]:a = 


i 
= 


geeeg 


| 
—) 


| {[b, 2y — 8]:6 = 0,1,...,7—1 
} [c, 2m + 2y — c — 2): 


c 
|[ 2m - 3 - (-1"4, 7],| 2m - 3 4 (-1)3,m+y7- | 


For m = 1 (mod 2) we put: 


*Other systems of pairs may be found in (5) and (4), but in the sequel we shall need the 


systems P,(m) as defined here. 





=2y+1,27+2,...,m+ty—2| 





ON QUADRUPLE SYSTEMS 147 


P23(m) 
P2341(m) 


{[2a, 2a + 28 + 1]:¢ = 0,1,...,m— 1}, 
{(2a, 2a — 28 — 1]:a = 0,1,...,m — 1}, 


(6 = 0,1,..., 4(m — 3)); 


f 
[b, 2y — 6]:6 = 0,1,...,y—1 
Pm—14y(m) = 4 [c, 2m + 2y — c]ic = 2y +1,274+2,...,m+y7—1}» 
ly, m + 7] 
(y = 0,1,...,m—1): 
It can be easily verified that the pairs in every system are mutually 
disjoint and that no pair appears twice. As the number of pairs in the systems 
is m(2m — 1) it follows that every pair appears in some system. 
2.2. The systems P;(m). In the sequel we shall also need another decom- 
position of pairs formed from 2m elements, namely into 2m systems P;(m), 








(¢ = 0,1,...,2m — 1) such that each of the m systems P,(m), (n = 0, 1, 
...,m — 1) should contain m — 1 mutually disjoint pairs not containing 
f } the elements 2n and 27 + 1, and each of the other m systems should contain 
{ m mutually disjoint pairs. 
| We shall form the systems P;(m) using the systems P.(m) defined in the 
1 preceding section. 
. If m = 0 (mod 2) it can easily be seen that 
. [2u,4u + 1] € Po,(m), 
[2m — 2 — 2p, 2m — 1 — dy] € Poys(m), (u = 1,2,...,4(m — 2));: 
[2m — 2,0] € P,(m); (2m — 1,1] € Pmai(m). 
re 
e .@ Clearly, these pais are mutually disjoint. We remove them from their re- 
ly spective systems and form from them a new system. 
Performing the following permutation of the elements 
2u, 4u + 1, 2m — 2 — Qu, 2m — 1 — 4p, 2m — 2,0, 2m — 1,1 ) 
) 4u,4u + 1,4 — 2 4p — 1 - , 0, 2m — 1,2m — 2 
(u = 1,2,...,4(m — 2)) 
)); we obtain by a suitable reordering of the systems the new systems P;(m). 
) In the case m = 1 (mod 2) we have 
| [2u, 4u + 1] € Pag(m), 
r> 2 [2m — 2 — 2u, 2m — 3 — 4y] § Poyii(m), (u = 0,1,...,4(m — 3 
[m — 1,2m — 1] € Pom—o(m). 
, | These pairs are again mutually disjoint. 
2) By the permutation 
2u, 4u + 1,2m — 2 — Qu, 2m — 3 — 4u,m —1 ,2m — 1 
‘ie 4u,4u + 1,44 +2 ,4u +3 , 2m — 2,2m — 1)’ 


(u = 0,1,...,4(m — 3)), 











148 HAIM HANANI 


of the elements and using the same procedure as in the case m = 0 (mod 2) 
we obtain the systems P;(m). 

2.3. The quadruples. Let there be given a set F of f elements0,1,...,f — 1. 

If a system S(3, 4, f) exists we say that it is possible to form a quadruple 
system from F and we write F € Q and alsof € Q. 

If F € Q, we shall in the sequel denote by {x, y, z, t} C F any quadruple 
in F, that is, an element of S(3,4,f). The number of quadruples will be 
denoted by q(/): 


- 
qf) = 54/4 — DE — 2). 
Iff+1€ Q, then FU {A} € Q where A is some additional element. The 


quadruples which contain A will be denoted by {A, u, v, w} and their number 


by p(f): 


Ps 
pP(f) = gl V — 1). 


The other quadruples will be denoted by {x, y, z,¢} and their number by 
, 
q (f): 


q(f) =a +1) — pf) = af — 1)(f -— 3). 


2.4. The elements. Elements of the sets used in this paper will often be 
denoted by a pair of numbers (7, 7), (¢ = 0,1,...,g —1;7 =0,1,...,f —1) 
i will then be called the first index and j the second index of the element. 

For the sake of uniformity we shall denote sometimes elements by (A, h), 
(h = 0,1,...,¢e— 1) instead of the more commonly used notation A,. 

In the above notation we shall also include elements (a, 6), (A, c) with 
a, b, and c not necessarily restricted to a < g, b < f, and c < e. In these 
cases the indices are to be taken modulo g, f, and e respectively. 

2.5. Checking of systems. In order to show that some given family of quad- 
ruples formed from the elements of a set NV (having m elements) are a system 
it must be proved that: 

(i) Every subset of N having 3 elements is contained in some quadruple. 

(ii) The intersection of every two quadruples has 2 elements at most. 

Evidently (i) will imply (ii) if it can be verified that the total number of 
the quadruples is g(m) and we shall use this method of checking the systems 
in the sequel. 


3. THEOREM. A system S(3,4, mn) exists if and only if n = 2 or 4 (mod 6). 


Proof. The necessity has been proved in § 1. The proof of sufficiency will 
be given by induction. Evidently 4 € Q. We shall show that if » = 2 or 
4 (mod 6) and if for every g < nm satisfying g = 2 or 4 (mod 6), g € Q holds, 
then also m € Q. The proof will be given separately for each of the following 
cases which evidently exhaust all the possibilities: 














ng 











ON QUADRUPLE SYSTEMS 149 


3.1. n = 4 or 8 (mod 12), 

3.2. n = 4 or 10 (mod 18), 

3.3. n = 34 (mod 36), 

3.4. n = 26 (mod 36), 

3.5. n = 2 or 10 (mod 24), (nm > 2), 
3.6. n = 14 or 38 (mod 72). 


3.1.4 n = 4or8 (mod 12). We put m = 2f, where f = 2 or 4 (mod 6) and 
by the assumption of the induction f € Q. Denote F = {j:j = 0,1,...,f—1}, 
N = {(t,j) :1 =0,1;7 =0,1,...,f — 1}. Further let {x, y,2,t} be any 
quadruple in F; the number of such quadruples is ¢(/). 

Form the following quadruples in N: 


Ly : (a1, x) (@2, y) (as, 2) (a4, t), 2) + G2 + a3 + ay = O (mod 2); 
L::(0,/)0, 0/)0,7), G=0,1,..., f-1;f =0,1,..., f—1;j #7’) 


In the quadruples L, three of the indices a;, a2, a3, a4 can be chosen freely 
from the numbers 0 and 1, and accordingly the number of these quadruples 
is 8q(f). The number of quadruples Lz is evidently $f(f — 1). The total num- 
ber of quadruples is therefore 8¢(f) + $f(f — 1) = q(n). 

By 2.5, it remains to be shown that every subset of V containing 3 elements 
{ (a1, j1) (42, j2) (4s, j2)} is included in some quadruple. This is, however, evident 
as: 

(a) if j1 # je # js ¥ jy it is included in some Lj; 

(b) otherwise it is included in some Lo. 

Consequently m € Q. 

3.2. n =4 or 10 (mod 18). Put m = 3f + 1, where f = 1 or 3 (mod 6). 
Thus f+1¢€Q. Denote F = {j:j7=0,1,...,f—1}, N ={(A);: @): 
i= 0,1,2;7 =0,1,...,f — 1}. (See 2.3 for definitions of {(A), u,v, w} and 
{x, y, 2, t}.) 


Form the following quadruples in NV: their number being: 
Li : (ai, x) (de, y) (a, 2) (da, 2), a; + de + a3 + ag = O (mod 3); 27q'(f) 
Le : (A) (by, u) (be, v) (b3, w), b, + b2 + 6; = 0 (mod 3); 9p(f) 
Lz: (4, u) (4, v) (4 + 1, w) (4 + 2, w), 9p(f) 
Li: GADGIG+IAG+ 1,7), j#7: 3.4f/(f -— 1) 
Ls : (A)(0, 7) (1, 7) (2,7); f 

totalling a q(n). 








‘This part of the proof is not new. It is well known that from f € Q follows 2f © Q, (see, for 
example, (1) and (7)). 














150 HAIM HANANI 


Now every subset T of N containing three elements is contained in one of 
the quadruples, namely: 
(a) if T = {(A) (ts, jr) (ee, je)} and 
(aa) if 71 # jo, in some Lo; 


(ab) if j; = je, in some Ls; 


(b) if T = { (41, J1) (42, j2) (ts, ja)} and 
(ba) if j1 # jo # js ¥ ji and 
(baa) if j1, je, js form a {u, v, w}, in Le or in Ls; 
(bab) otherwise, in LZ; 
(bb) if j: = je # js and 
(bba) if i; # te H i3 ¥ 1, in L3; 
(bbb) otherwise, in L, 
(bc) if ji = je = ja in Ls. 
It is thus proved that m € Q. 


3.3. n = 34 (mod 36). Put m = 3f + 4, where f = 10(mod 12) and denote 
f = 12k + 10. Here f +4€ Q. Denote F = {j7:j7 =0,1,..., f-1}, 
N = {(i, 7); (A, A) 24 = 0,1,2;7 = 0,1,..., f —1;h4 = 0,1, 2,3}. We have 
F = FU{(A,h) :h = 0,1, 2,3} € Q. By {x, y, 2, t} we denote quadruples in 


F, one of them being {(A,) :h4 = 0, 1, 2, 3}. 


Form the following quadruples in NV: their number being: 
L, : (A, 0)(A, 1)(A, 2) (A, 3); | 
Lz : (i, x) (i, y) G, 2) Gi, 8,5 3i¢Uf + 4) — 1) 


quadruple L, excluded; 

L;: (A, a) (0, a2) (1, a3) (2, a4), 4f? 
a; + a2 + a3 + as = O(mod f); 

La: (i + 2, bs) (i, by + 2k + 1 + i(4k + 2) — d)(t, b: + 2k 


+2 + i(4k + 2) + d)(i + 1, 2), 3(2k + 1)f? 
b; + b2 + 6; = O(mod f), 
ff ee 2k; 
Ls : (i, ta) (i, Sa) (i + 1, ra’) (i + 1, 5a’), 3(8k + 7) (3f)’ 


[ra Sa] and [ra’, Sa’ | are (equal or different) 
pairs in P,(6k + 5), (see 2.1.), 
a= 4k+2,4k +3,..., 12k + 8; 
totalling q(n) 


5Whenever for x, y, z, or ¢ appears (A, hk), omit the first index 7. 








ON QUADRUPLE SYSTEMS 151 


Again, every subset T of NV containing three elements is contained in some 
quadruple: 


(a) if T = {(A, hy)(A, he) (A, As)}, in Ly; 
(b) if T = {(A, hy)(A, Ae) (is, f)}, in Le; 
(c) if T = {(A, Ay) (41, jr) (ee, je) and 


(ca) if 4; = 42, in Lo; 
(cb) otherwise, in L;; 
(d) if T = { (41, 71) (¢2, je) (ts, js)} and 

(da) if 4; # t2 ¥ is ¥ 4; and 
(daa) ifjfit+j2+j: =f — 3,f —2,f —1, or 0(mod f), in L;; 
(dab) otherwise, in Ly; 

(db) if 4; = ig # i3 and 
(dba) if |j2 — j:| = 1(mod 2) and |j2 — j;| < 4k + 1, in La; 
(dbb) otherwise, in Zs; 

(dc) if 4; = 4 = iz, in Le. 

Thus 2 € Q. 

3.4. n = 26(mod 36). Here n = 3f + 2, f = 8(mod 12), or f = 12k + 8; 
f+2¢€Q. Denote F = {j:7j =0,1,...,f—1}, N = {(@,j); (A, 4) :t4 =0, 
1,2;7 =0,1,...,f—1l;h4 = 0,1}. Wehave F = FU {(A,h) :h =0, 1} € O. 
By {x, y, 2, t} denote quadruples in F. 


Form the following quadruples in NV: their number being: 
Le : (i, x) (t, y) (2, 2) (4, 83° 3q(f + 2) 
L;: (A, a;) (0, ae) (1, a3) (2, a), 2f? 


a; + a2 + a; + ay = 0(mod f); 
La: (i + 2, b3) (4, db) + 2k + 1 + 7(4k + 2) — d)(t, b; + 2k 


+2 + i1(4k + 2) + d)(i + 1, de), 3(2k + 1)f? 
b; + b: + bs = O(mod f), 
d=0,1,..., 2k; 
Ls : (4, ra) (4, Sa) (4 + 1, ra’) (4 + 1, Sa’), 3(8k + 5) ($f)? 


[ra Sa] and [ra’, Sa’] are pairs 
in P,(6k + 4), 
a= 4k+2,4k+3,..., 12k + 6; 


totalling q(n) 
Checking that every subset of N containing 3 elements is contained in 
some quadruple is made in the same way as in the preceding section with 


the only difference that: the case (a) is omitted and (daa) reads: if j; + je 
+ j; =f — 1, or 0(mod f) a.s.o. Consequently n € Q. 


® Ibid. 








152 HAIM HANANI 


3.5. nm =2 or 10(mod 24), » > 2. Put n = 4f +2, f =0 or 2(mod 6), 
(f > 0); f = 2k; by the assumption of the induction f + 2 € Q. Denote 


F = {j:j3=0,1,...,f-— 1}, N = {(h,i,7); (A, D :h =0,1;%4 = 0,1; 
(Fo © See f —1;1 =0, 1}. By {x, y, 2, t} denote quadruples in F U { (A, D): 
= 0,1} € @. 
Form the quadruples in NV: their number being: 
L, : (h, i, x) (A, i, y) (h, 4, 2) (h, 4, t);7 4q(f + 2) 
L: : (A, 1) (0, 0, 2c,) (0, 1, 2c = e) (1, €, 2¢3 a l), f2 
Cy + Co + C3 = 0(mod &), 
e=0Q,1; 
L; : (A, 2) (0, 0, 2¢1 - 1) (0, l, 2c2 = 1 — e)(1, é, 2c3 a 1 —_ l); f2 
L, : (A, D(A, 0, 2e:) (1, 1, 2c2 — €) (0, «, 2c3 + 1 — J); f2 
Ls : (A, )(1, 0, 2c, + 1), 1, 2ce -—-l-—- e) (0, €, 2¢3 + l); f? 
Le : (h, 0, 2c, + €)(h, 1, 2co — €) (kh + 1, 0, F,,) (A + 1,0, 5), (k — 1)f? 
[7.3, 5-3] are pairs in P.,(k), (see 2.2), 
ome Ge Bs «0% k—1; 
L; : (h, 0, 2c, — 1 + ©) (h, 1, 2c. — )(A + 1,1,%,)(44+1,1,83); (k — UP? 
Ls ° (h, 0, 2c; a e) (h, 1, 2c — e) (A — 1, CR Fries) (h a l, i ee | kf? 
Lo : (h, 0, 2c, — 1 + €)(h, 1, 2c2 — €) (hh + 1, 0, Fores) (A + 1,0, Sercs); Rf? 
Lio : (h, 0, tx) (h, 0, Sa) (h, 1, Ta’) (h, 1, Sa’), 2(f — 1)k? 
[ray Sa] and [ra’, Sa] are pairs 
in P.(k), (see 2.1.), 
a=0,1,..., f —2:; 
totalling q(n) 


It will now be checked that every triple 7 in N is contained in some quad- 
ruple. | 
(a) If T is of the form { (A, 0)(A, 1) (A1, i, j:)}, it is contained in some L;; | 
(b) if T = {(A, hk) (a, t1, jx) (Ae, 42, j2)} and 
(ba) if hy = he and 
(baa) if 7; = 4, in Ly; 
(bab) if i; ¥ i: say i; = 0, i2 = 1 and ) 


(baba) if 7; = O(mod 2), in Le or Ly; 
(babb) if jf; 1(mod 2), in Lz or Ls; 


i il 


"Ibid. f- 











ON QUADRUPLE SYSTEMS 153 


(bb) if hy # he say h, = 0, hy = 1 and 
(bba) if 4; = 72 = 0 and 
(bbaa) iff; tie +h 
(bbab) iff; +j2+h 
(bbb) if 7; = 72 = 1 and 
(bbba) iff; +je+h 
(bbbb) iff; +j2o+h 
(bbc) if 74; = 0, i2 = 1, and 
(bbca) if j; + je + 1, = 0(mod 2), in Lz or Ls, (€ = 1); 
(bbeb) if j: + j2 + 1, = 1(mod 2), in Ly or Ls, (€ = 0); 
(bbd) if 4; = 1, 72 = 0 and 
(bbda) if 7; + je + 1, = 0(mod 2), in Le or Ls, (e = 0); 
(bbdb) if 7; + je + 1, = 1(mod 2), in Ly or Ls, (e = 1); 


0(mod 2), in Le or Ls, (€ = 0): 
1(mod 2), in Ly, or Ls, (« = 0); 


0(mod 2), in ZL, or Ls, (e€ = 1); 
1(mod 2), in Le or Ls, (e = 1); 


i 


(c) if T = { (Ag, 41, j1) (Ae, t2, j2) (As, ts, j2)} and 
(ca) if hy = he = hg and 
(caa) if 4; = 42 = 43, in LZ; 
(cab) otherwise, in Ly; 
(cb) if hy = he ¥ hs and 
(cba) if 4; = 42 = 0, in Le or Ly; 
(cbb) if 4; = t2 = 1, in Lz or Ls; 
(cbc) if 4; ¥ tg, say i; = 0, ig = 1 and 
(cbca) if 7; = 0 and 
(cbcaa) if 7; + je = 1(mod 2), in Ly; 
(cbcab) if j: + j2 = 0(mod 2) and 
(cbcaba) if 7; + je + js € 0, 1(mod f), in Le; 
(cbcabb) if j; + j2 + 7; = 0 or 1(mod f), then 
if hy = ho = 0, in Le or Lz, (« = 0); 
if hy = he = 1, in La or Ls, (€ = 0); 
(cbcb) if 7; = l and 
(cbcba) if 7; + j2 = 0(mod 2), in Ls; 
(cbcbb) if 71 + j2 = 1(mod 2) and 
(cbcbba) if 7; + j2 + j; # 0,f — 1(modf), 
in Lz; 
(cbcbbb) if 7; + j2 + j; = Oorf — 1(modf), 
then 
if hy = he = 0, in Lz or Ls, (€ = 1); 
if hy = he = 1, in Ly or Ls, (€ = 1). 
This proves that m € Q. 


3.6. n = 14 or 38(mod 72). Here m = 12f + 2, f =1 or 3(mod6) and 
f+16€ Q. We shall prove that n € Q. 











154 HAIM HANANI 


We begin proving that 14 € Q. We take as elements the 14 symbols: 
0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D; 
and form g(14) = 91 quadruples as follows: 


0125 038D 1236 157C 24AC 3579 479C 
013B 039A 1247 1589 24BD 358B 5678 
0146 0459 128B 1I5BD 257B 35AC 569B 
0178 047B 129A 1679 258A 367B 56CD 
019D 048A 12CD 168D 259C 3689 59A D 
O1AC O04CD = 1345 16BC 267C 36AD 68AC 
0234 057D 137D 17AB 269D 3BCD 789A 
0268 058C 138A 235D 26AB 457A 78BC 
0279 05AB  139C 237A 278D 458D 79BD 
02AD 067A 148C 238C 346C 45BC 7ACD 
02BC 069C 149B 239B 3478 467D 89CD 
0356 06BD I14AD_ 2456 349D 168B 8.4BD 
037C O089B 156A 2489 34AB 469A 9ABC 


It can be easily checked that every three elements are included in some 
quadruple and consequently these quadruples form a S(3, 4, 14). 

We now form the set N’ = {(i,7); (A, A) :i =0,1,2;7 =0,1,..., ll; 
h = 0,1} having 38 elements and we will show that N’ € Q. The system 
of quadruples in N’ will be constructed so that it will contain all the quadruples 
in { (4,7); (A, &4);7 = 0,1,...,11;4 = 0,1} for « = 0,1, 2. By {x’, y’, 2’, 7} 


denote quadruples in {7; (A, ’#):7 =0,1,..., 11;h = 0, 1}. 

Form the following quadruples: their number being: 
L,: (i, x’) (i, y’) (4, 2’) (4, ¢) 8 273 
Lez : (A, h)(O, 63) (1, be) (2, bs + 3h), 288 

b; + be + 5b; = 0(mod 12), 

hk = 0,1; 
Lz: (i, bs + 4 + 1) (4, bs + 7 + 2) (4 + 1, 52) (4 + 2, ds); 432 
La: i, Gi + 1,7 + 66) (i + 2, Ge — 2j + 1)(i + 2, Be — 2j — 1), 72 

«= 0,1; 
Ls: (4,7) (¢ + 1,7 + 6) (¢ + 2, 6e — 27 + 2) (4 + 2, be — 27 — 2); 72 
Le: (i, 7) (4 + 1,7 + 6€ — 3)(¢ + 2, Ge — 27 + 1) (4 + 2, be — 27 4 2); 72 
Lz: (4,7) (¢ + 1,7 + 6e + 3)(¢ + 2, Ge — 27 — 1) (4 + 2, Ge — 27 — 2); 72 
Ls: (4, j) (i,j + 6) + 1,7 + 3c) + 1,7 + 6 + 3); 36 


ST bid. 





ON QUADRUPLE SYSTEMS 155 


Lg : (4, 2g + 3e) (4, 2g + 6 + Be) (2’, 2g + 1)(a’, 2g + 5), 72 
i’ 9 4, 
g = 0, 1, 2, 3, 4, 5; 
Lyo : (i, 2g + Be) (i, 2g + 6 + Be) (e’, 2g + 2)(2’, 2g + 4); 72 
Lu: (4, 7)(,j7 + IG + 1,7 + 3e)(@ + 1,7 + 3e + 1), 144 
e=0,1,2,3; 
Lis: (4, j) (i, § + 2)(4 + 1,7 + Be) (i + 1,7 + Be + 2); 144 
Li3 : (4,7) (4,7 + 4)(¢ + 1,7 + 3e)(¢ + 1,7 + 3e + 4); 144 
Lia: (4, Tx) (4, Sa) (4", Fa’) (4", Sa’) 216 


[ra, Sa] and [ra’, Sa] are pairs 
in P,(6), (see 2.1), 
a= - 5; 





totalling 2109 = ¢(38) 


Checking that every triple 7’ in N’ is contained in some quadruple is 
carried out as follows: 


(a) if T’ = {(A,0)(A, 1)(i, f:)}, it is contained in Ly; 


(b) if 7’ = {(A, Ay) (ty, ji) (42, j2)} and 
(ba) if 4; = 4, in Ly; 
(bb) if 4; # te, in Le; 
(c) if 7’ = { (4, jr) (te, je) (ts, js)} and 
(ca) if 4; = te = 43, in L); 
(cb) if % 
Now 
(cba) if j; + je + js # 0 (mod 3) and 
(cbaa) if je — j; = 1 (mod 12), in Lis; 
(cbab) if j2 — j, = 2 (mod 12), in Lia; 
(cbac) if je — j, = 3 (mod 12), in Ls; 
4 
) 


ig * 13, we may assume that 1 < (jz — j:) (mod 12) < 6. 


(chad) if j2 — j; = 4 (mod 12), in Lis; 
(cbae) if je — j: = 5 (mod 12), in Lig; 
(cbaf) if je — j, = 6 (mod 12), in Ly or Lio; 
(cbb) if j; + je + js = 0 (mod 3) and 
(cbba) if j2 — j: = 1 (mod 12) and 
(cbbaa) if 7; = 0 (mod 2), in Ly; 
(cbbab) if j; = 1 (mod 2) in Le; 
(cbbb) if je — 7; = 2 (mod 12) and 
(cbbba) if 7; = 0 (mod 2), in Lio; 
(cbbbb) if 7; = 1 (mod 2), in La; 


~ 











156 HAIM HANANI 


(cbbc) if j2 — j; = 3 (mod 12), in L;; 
(cbbd) if j2 — j, = 4 (mod 12) and 
(cbbda) if 7, = 0 (mod 2), in Ls; 
(cbbdb) if 7; = 1 (mod 2), in Ly; 
(cbbe) if j2 — 7, = 5 (mod 12), in Ly,; 
(cbbf) if j72 — j1 = 6 (mod 12), in Ls; 
(cc) if 4; F te H i3 ¥ 1% and 
(cca) if j: + je + js 


iil 


0 or 3(mod 12), in Le; 
(ccb) if j: + j2 + js = 4, 5, 6, 7, 8, or 9(mod 12), in Ls; 
(ccc) if j1 + jo + js = 1, 2, 10 or 11(mod 12) it is evident that 
two of the second indices, say j; and jz must be j2 — j; = 0 
(mod 3). Now 
(ccca) if je — 7; = 0 (mod 6) and 
(cccaa) if 7) +j2+j;=1 or 11(mod 12), T” is 
contained in Ly; 
(cccab) if j; + je + j3 = 20r 10(mod 12), in Ls; 
(cccb) if j2 — j; = 3 (mod 6) and 
(cecba) if 7; + je + j3 = 1 or 2(mod 12), in Lz; 
(cecbb) if 7; + je + js = 10 or 11(mod 12), in Le. 


ih Wi 


il 


Thus 38 € Q is proved: 


We are now able to prove the case m = 14 or 38(mod 72), (that is, f = 1 
or 3(mod 6)) generally. We introduce an auxiliary element B and obtain 
F = {j;B:j =0,1,..., f—1} € Q. (The quadruples {B, u,v, w} and 
{x, y, z, t} in F are defined in § 2.3.) Denote N = { (i, j); (A, A):i = 0,1,..., 


f—1;j7 =0,1,...,11;4 = 0,1} and form the quadruples in VY: 
their number being: 
M; : (i, x’) (i, y’) (4, 2’) (i, &)3° 91f 


| (A, h)(u, b;) (v, be) (w, 63 + 3h), 
b; + be + b3 = 0(mod 12); 


M:: (ss, c2:) (@, cxs) (en, ex) (eo, axa): (2109-273) . p(f) 
(i, 81) (i, Be) (7’, Bs) (7, Bs) 
| i’ # i. 


a,,8,, (v = 1,2,3, 4) are to be replaced by the second indices 

of L;— Ly, corresponding to the first indices 0, 1, 2 for u, v, w 

respectively. It should be noted that 7 and 7’ define uniquely 

a {u,v, w} in which they are contained and therefore they 

may be considered as two indices from this {u, v, w}. 

M; : (x, a3) (y, a2) (2, a3) (t, a4), 2) + G2 + 23 + ag = O(mod 12); 1728.q' (f) 


totalling q(n) 


*Tbid. 





ON QUADRUPLE SYSTEMS 15 


~I 


It is easy to see that every triple T in N is contained in some quadruple: 
(a) if T 
(b) if T = {(A, Ay) (ty, jr) (ee, j2)} and 


(ba) if 7; = %, it is contained in M;; 
(bb) if 74; ¥ t2, in Me; 


{ (A, 6)(A, 1) (4, j1)} it is contained in Mj; 


(c) if T = { (41, jx) (ee, je) (is, js)} and 
(ca) if 4; = tg = 43, in My; 
(cb) if i; = te ¥ iz, in Mo; 
(cc) if 4) 4 te ¥ 13 ¥ 1, and 
(cca) if 2, 42,73 form a {u, v, w}, in Mo; 
(ccb) otherwise in M3. 


Consequently in this case again m € Q, and the proof is herewith completed. 


REFERENCES 


1. R. D. Carmichael, Introduction to the theory of groups of finite order (New York, 1956), 
415-441. 
E. H. Moore, Concerning triple systems, Math. Annal. 43 (1893), 271-285. 
Tactical memoranda, Amer. J. Math. 18 (1896), 264-303. 
E. Netto, Lehrbuch der Combinatorik, zweite Auflage (Leipzig, 1927), pp. 202-220, 321-329 
. M. Reiss, Ueber eine Steinersche combinatorische Aufgabe, J]. reine und angew. Math., 56 
(1859), 326-344. 
. J. Steiner, Combinatorische Aufgabe, J. reine und angew. Math., 45 (1853), 181-182; also, 
Gesammelie Werke 11 (Berlin, 1884), pp. 435-436. 
E. Witt, Ueber Steinersche Systeme, Abh. Math. Sem. Hamburg, 12 (1938), 265-275 
. G. Tarry, Le probléme des 36 officiers, C. R. Assoc. Franc. Av. Sci., 1 (1900), 1 
(1901), 170-203. 


ak oh 


o 


on 


Technion, Israel Institute of Technology, Haifa 











A METRIZATION FOR POWER-SETS WITH 
APPLICATIONS TO COMBINATORIAL ANALYSIS 


ROBERT SILVERMAN 


1. Introduction. Combinatorial configurations may generally be phrased 
in terms of arrangements of objects into sets subject to certain conditions. In 
view of this, the question arises as to whether given a set S and its power-set 
Us (the class of all subsets of S), it might be possible to structure Us in a 
combinatorially significant manner. This paper proposes and investigates one 
such structuring achieved by defining a distance function over Us. 

Given A, B in Us, define their distance by 


d(A, B) = N({A UB]-[A ON B))/2, 


where \V(E) denotes the number of elements in E, + © being an admissible 
value. One readily verifies that the distance function satisfies the metric 
postulatesd(A, B) = Oifandonlyif A = B,andd(A, B) < d(A, C) + d(B,C) 
for all A, B, Cin Us (15). More generally, we may define a higher dimensional 
metric by associating with every r-tuple E,,..., E, of elements of Us, the 
number d(&,,..., £,) = N(U A; — (\ A,)/r. Although it appears that this 
will be necessary in order to obtain metric characterizations of, for example, 
the theorem of Desargues, only the ordinary metric is studied here. 
Given the sets S,,...,: %, denote by 


mS ; = S, x S2 ere Si = + (sy pecoegld Sx); S4 S;}, 


their Cartesian product of ordered k-tuples. Since wS; may be viewed as a 
subclass of an appropriate power-set by identifying the element (s;,... , Sx) 
with the set {(1,5;),..., (&, s¢)}, the above definition also yields a metri- 
zation for Cartesian products which may be restated: For x,y in S,, 
Jo Xe), 9 = (y1,.-.-, 9x), d(x, y) is the number of subscripts 7 for 
which x; ~ y,,1 =1,...,k. 

Section 3 gives metric characterizations of some of the classical configura- 
tions and their generalizations, such as balanced incomplete block designs 
(and, in particular, v, k, \ configurations and projective planes) and ortho- 
gonal Latin squares and cubes. Section 4 sets forth some theorems for metrized 
Cartesian product spaces. 


2. Definitions and notation. In order to reduce to a minimum the 
introduction of new terminology, wherever feasible the author has adopted 
Received January 12, 1959. This work was sponsored in part by the Office of Ordnance 


Research. The paper is based largely on the author’s Ph.D. dissertation submitted to the Ohio 
State University in 1958. 


158 











A METRIZATION FOR POWER-SETS 159 


that used by Blumenthal (4; 5), whose excellent books also nourished several 
interesting trains of thought. In the following, M denotes an abstract metric 
space, and E a subspace of M. Wherever applicable, + © is regarded as an 
admissible value. 


DEFINITION 1. If M = {a;,a@2,...,} is countable, it may be completely 
specified by the symmetric distance matrix 


A= [a;,], Qe, = d(a,, a,). 


DEFINITION 2. For a in M, r > 0, the open sphere and closed sphere with 
centre a, radius r are defined, respectively, by 


s(a,r) = {x; xin M, d(x, a) < r}, 
c(a,r) = {x;xin M,d(x,a) <r}. 
Note that in general a sphere need not have a unique centre or radius. 


DEFINITION 3. For x in M, the distance of x from E is given by d(x, E) = g.l.b. 
d(x, y) for y in E. 


DEFINITION 4. Two metric spaces M and M’ are isometric provided there 
exists a mapping a from M onto M’ such that d(x, y) = d(a(x), a(y)) for all 
x,y in M. We write M ~ M’. Note that a is biunique since a(x) = a(y) 
implies d(a(x),a(y)) = d(x, y) = 0. If M = M’, the isometry is termed a 
motion. Two subsets of M are superposable provided a motion exists that maps 
one onto the other. 


DEFINITION 5. E is a metric basis of M provided each point of M is uniquely 
determined by its distances from the points of E. 


DEFINITION 6. The major diameter A(E), of E, and the minor diameter 
6(£), of E are defined by 


. 


A(E) = lL.u.b. d(x, y) for x, y in E, 
6(E) = g.l.b. d(x, y) for x, y in E, x # y. 


If E contains fewer than two points, define 6(£) = 0. 

Combinatorial configurations generally are highly symmetric in various 
aspects of their structure. Searching for a means of obtaining some sort of 
““symmetrizing” condition in Us, it was discovered that one way of achieving 
this is to require that Us contain a “large’’ number of elements mutually 
“far apart.’’ These considerations motivate the next definition. 


DEFINITION 7. The t-extent of E, e(E, t), is the greatest integer m such that 
E contains m distinct points with minor diameter greater than ¢. If no two 
points of E have distance greater than ¢, set e(E, t) = 1, while if for m arbi- 
trarily large there are m points of E with minor diameter exceeding t, define 


e(E,t) = + o. 








160 ROBERT SILVERMAN 


As we shall see in the next section, the concept of ¢-extent enables us to 
give simple metric characterizations of the various configurations examined 
there. 

Let S(m) denote a set of nm elements (m > 1), and S*(m) (k > 1) the k-fold 
Cartesian product of S(m), 


S*(m) = { (xy, ... , Xe) Xs E S(es)}. 


Assume that S*(m) has been metrized as in the preceding section, so that for 
x,y in S*(m), x = (x1,...,%x), 9 = (M1,.--, yx), d(x, y) is the number of 
subscripts i for which x, ¥ y;,, 7 =1,...,%. For 0 <cr<k, every set of 
n’ + 1 elements of S*(m) has minor diameter at most k — r. Hence the k — r 
extent of every subspace E of S*(m) satisfies e(E, k — r) < n’. We next define 
terms to describe subspaces which attain this maximum extent. 


DEFINITION 8. A subspace E of S*(mn) is r-orthogonal, 0 < r < k, provided 
e(E, k — r) = n’. If in addition E contains precisely n’ elements, E is termed 
an L(n,k,r) space. (Thus E is r-orthogonal if and only if E contains an 
L(n, k, r) space.) 


For a given subspace E, r-orthogonality does not imply (r — 1)-ortho- 
gonality. S*(m) always has orthogonality 0, 1, and k. Indeed, any point con- 
stitutes an L(n,k,0) space; the points (7,7,..., 1), 7 =1,...,m comprise 
an L(n, k, 1) space, and S*(m) is itself an L(n, k, k) space. For values between 
1 and k the property becomes non-trivial, and, as we shall see in the following 
section, is related to some of the classical unsolved problems in combinatorial 
analysis. 


3. Metric characterizations of some combinatorial configurations. 

(a) Latin squares and cubes. A Latin square of order n, A = |a,,|, is an 
n X n matrix whose entries are from a set of m distinct symbols and such that 
each symbol appears exactly once in each row and column. Thus a Latin square 


of order m is essentially the multiplication table of a loop of order n. Two 


Latin squares A = [a,,|, B = [b,;] of order m are Graeco-Latin provided the 
n® ordered pairs (a;,, 6;;) are all distinct. A set of Latin squares of order n, 
A,, A2,...,Am, is orthogonal provided A; and A, are Graeco-Latin for 


all i # j. In this event, one readily shows that m < n — 1. An orthogonal set 
is complete provided m = n — 1. 

A Latin cube of order n, A = |a;%]|, i sa cubical array of n®* cells (in n 
row-planes, ” column-planes, and m layers) whose entries are from a set of 
n distinct symbols and such that whenever a,,; = @y.. and at least two of 
the equalities r = u, s = v, t = w hold, then the third also holds. Note that 
this condition holds if and only if each row-plane, column-plane, and layer 
is a Latin square of order m. Two Latin cubes of order m are Graeco-Latin 
provided every pair of corresponding row-planes, column-planes, and layers 
is a Graeco-Latin square. Three Latin cubes, A = [a;%], B = [bin], C = [cin], 





9) 








A METRIZATION FOR POWER-SETS 161 


of order m are strongly Eulerian provided each pair is Graeco-Latin and the 
n® ordered triples (Gi, bi, Ci.) are all distinct. (These conditions are 
stronger than those of Ball (2).) A set of pairwise Graeco-Latin cubes of order 
n, Ay,...,Am, is orthogonal provided A;, A,, A, are strongly Eulerian for all 
i,j,k pairwise distinct. Again one readily shows that m < nm — 1, and an 
orthogonal set is termed complete provided m = n — 1. 

There exists a fairly extensive literature on Latin squares. (In this con- 
nection, see the fine historical review by Norton (19).) Euler conjectured that 
for n = 4k + 2, Graeco-Latin squares of order m do not exist, and Tarry 
(25) verified this for n = 6. Aside from n = 6 (nm = 2 is, in a sense, vacuous 
since a complete set consists of a single square), the question of the validity 
of the conjecture has resisted all determined onslaught (although Mann 
(17) has ruled out certain candidates, among these being the group multi- 
plication tables). The case » = 10 remains the first undecided instance.! 
MacNeish (16) seems to have been the first to establish the existence of 
complete sets of orthogonal Latin squares of prime power order. The interest 
in orthogonal Latin squares and finite projective planes was mutually en- 
hanced when Bose (6) and Levi (14) independently showed the equivalence 
of complete sets of such squares to the planes. 


Given a set of orthogonal Latin squares A;,..., Am, of order n, construct 
the associated n* k-tuples (k = m+ 2) in the usual manner. (Here the 
k-tuple (i,...,%) is admitted if and only if 7, is in row % 1, column 4, 
of A,,j = 1,...,& — 2.) One readily verifies that these n*? elements actually 


comprise an L(n, k, 2) space, since any pair of the elements having at least 
two corresponding components equal would violate either the Latin con- 
dition on rows or columns, or the orthogonality condition. Thus the minor 
diameter of the n*? elements exceeds k — 2. Conversely, given an L(n, k, 2) 
space, we may reverse the process and obtain k — 2 orthogonal Latin squares 
of side nm. The same procedure may be employed to show the equivalence of 
L(n, k,3) spaces and sets of k — 3 orthogonal Latin cubes. 

(b) Finite nets. For k,n positive integers with k > 3, Bruck (9) defines a 
(finite) net N of degree k, order n, as ‘‘a system of undefined objects called 
‘points’ and ‘lines’ together with an incidence relationship (‘point is on line’ 
or ‘line passes through point’) such that: 

(i) N contains k (non-empty) classes of lines. 

(ii) Two lines a, 6 of N belonging to distinct classes, have a unique common 
point P. 

(iii) Each point P of N is on exactly one line of each class. 

(iv) Some line of NV has exactly m distinct points. 

“A finite affine plane with m points on each line, m > 2, is simply a net 
of degree nm + 1, order » (13). A loop of order n is essentially a net of degree 
3, order m (1; 3). More generally, for 3 < k < n + 1, aset of k — 2 mutually 


‘See the addendum for recent developments. 











162 ROBERT SILVERMAN 


orthogonal m X m Latin squares may be used to define a net of degree , 
order m (and conversely) by paralleling Bose’s correspondence (6) between 
affine planes and complete sets of orthogonal Latin squares.” 

For k > r > 1 we may generalize Bruck’s configuration to a finite net of 
degree k, order n, and dimension r by replacing (ii) and (iv) with 

(ii’) Every r lines /,,...,/, of N belonging to pairwise distinct classes 
have a unique common point P. 

(iv’) Some line of N has exactly n’—' points. 
As immediate consequences of the axioms we have 

(1) Every class contains m lines. 


(2) Every line has exactly n’—' points. 
(3) N contains n’ points. 
For let Ai,...,A+r, Arga,..., Ax be the R classes of lines of V, and suppose 


A, contains m, lines, and N contains m points. One readily shows m, and m 
to be finite. Then from (ii’) and (iii) we obtain the system of r + 1 equations 
am, =m(j=1,...,r+1,j #t;4 = 1,...,7+ 1) which have the unique 


solution m, = m'/’",1 = 1,...,7 + 1, and since we may replace A,,, by any 
other A,, we obtain m,; = m'/’",i = 1,...,%. If we next consider any fixed 
line of A, together with the classes A»,...,A,, then (ii’) and (iii) imply 


that the line passes through m‘’-"’’ points, and this together with (iv’) 
implies m = n’. 

Now let us co-ordinatize NV by assigning to the point P the co-ordinates 
(i;,..., %) provided P is on the 7,th line of the jth class. Then (ii’) and (iii) 
imply that there is a 1-1 correspondence between points and co-ordinates, 
and that as elements of S*(m) any two of these ordered k-tuples have distance 
exceeding k — r. Since there are n’ distinct such k-tuples, and each component 
of a k-tuple can assume values, the n’ k-tuples comprise an L(n, k, r) space. 
Conversely, one may reverse the above process, and we thus have a corre- 
spondence between L(n, k, r) spaces with 1 < r < k and finite nets of degree 
k, order n, dimension r. In particular then, an L(n, 3, 2) space is essentially 
a loop of order n, an L(n,n + 1,2) space defines a finite affine plane with 
n points on each line (m > 2), and from an L(n, k, 2) space we may construct 
a system of k — 2 orthogonal Latin squares of side n. 

(c) Hypercubes and orthogonal arrays. Rao (21) defines a hypercube of 
strength d as follows: “Let there be m factors A;, Ao, ..., Am each of which 


can assume s different values. We define an ordered set (7), i2,...,%,) as a 
combination of m factors obtained by the selection of 7,;th, isth... values 
of the first, second, ... , factors respectively. There are s” such combinations 
of which a subset of s‘ combinations may be called a (m,s,t) array. An 
(m, s,t) array is said to be of strength d if all combinations of any d of the 
m factors occur in equal number (s‘~*) of times. An array of strength d repre- 
sented by (m, s, t, d) is, alternatively, called a hypercube of strength d.”’ For 
t = d, these hypercubes correspond to L(n,k,r) spaces with n = s, r = d, 
and conversely. 








A METRIZATION FOR POWER-SETS 163 


Bose and Bush (7, 8) weakened the condition of s‘ combinations to V = \s* 
combinations, to obtain an orthogonal array of strength d, size NV, index A, 
k constraints, and s levels which they define as “a k X N matrix A, with 
entries from a set = of s > 2 elements... [such that] each d X N submatrix 
of A contains all possible d X 1 column vectors with the same frequency 
\.”” For A = 1 it is clear that the column vectors of A comprise an L(n, k, r) 
space, and conversely. 

(d) A configuration. Let v elements be arranged into v + 1 sets 7),..., 
T 41 such that for 1 # j, the number of elements which are in either 7; or 
T, but not in both is k. We may co-ordinatize the sets of the configuration by 


assigning to a set the co-ordinates (4;,...,%,), where 1, = 1 if the jth element 
is in the set, and i, = 0 otherwise. If x:,...,%,4: are the co-ordinates of 
T,,..., Tosi, respectively, then the x; comprise a subspace of S*(2), S(2) 


= {0, 1}, satisfying d(x,, x,) = k, for all i # j. We will discuss this configura- 
tion further in the next example. 
As an illustration, for k = 4, v = 7, consider 


T, = {1,2} T, = {1, 3, 6, 7} T; = {2, 4, 6, 7} 
T2 = {3, 4} T; = {1,4, 5, 7} Ts = {1, 2, 3, 4, 5, 6}. 
T; = {5, 6} Ts = {2, 3, 5, 7} 


(e) The v, k, \ configuration. Consider next the now classic v, k, \ configura- 
tion defined in Chowla and Ryser (11) as an arrangement of v elements into 
v sets such that every set contains exactly k distinct elements and such that 
every pair of sets has exactly A elements in common, 0 <A <k < vv. In 
statistics these configurations are termed symmetrical balanced incomplete 
block designs. For \ = 1 and k = n+ 1, n > 2, the configuration reduces 
to a projective plane with m+ 1 points per line, and for v = 4m — 1, 
k = 2m — 1, \ = m —1, it is equivalent to a Hadamard matrix of order 
N = 4m (20) (these are the +1 matrices H satisfying HH™ = NI, where H 
is of order N and I is the identity matrix). For a comprehensive summary of 
results see Ryser (22). With the v, k, \ configuration we may associate its 
characterizing v X v incidence matrix A = [a,,;], where a,;, = 1 if the jth 
element is in the ith set, and 0 otherwise. Actually, constructing the incidence 
matrix is equivalent to co-ordinatizing the sets of the configuration, the 7th 
row representing the co-ordinates of the ith set. It is apparent that the v 


sets of co-ordinates so obtained comprise v elements of S*(2), S(2) = {0, 1}, 
satisfying: 

(i) If s = (0,0,...,0), and the wv elements are %,..., x,, then 
d(x,,s) =k fori1=1,...,9. 


(ii) d(xy,x,;) = 2(k — A), i Fj. 

That the metric characterization of v, k, \ tends toward the heart of the 
matter is suggested in (ii) by the fact that the value k — \ which appears 
in both the v, k, \ design and its complementary design (the design obtained 








164 ROBERT SILVERMAN 


by replacing each set by its complement) and plays such a critical role in 
the non-existence theorems, occurs explicitly. 

Also of interest is the fact that the “‘strong’’ converse of the above holds. 
That is, given v + 1 elements s’, x;',...,x,’ of S°(2) satisfying 

(i’) d(x,,s’) = kfort=1,...,9 

(ii) d(x’, x;') = 2(kR —A), t+ #j,0<A<k<y, 


we may construct v elements x,,...,%x, satisfying (i) and (ii) and hence 
constituting a v, k, \ configuration. This may be seen as follows. If in the 
jth components of x,;’,... , x,’, s’ we replace 0’s by 1’s and 1's by 0’s, we clearly 


obtain an isometric space. Now perform this replacement in the jth components 
if and only if the jth component of s’ is 1. Then we obtain an isometric space 
with s = (0,0,...,0) as the image of s’. 

Consider again the configuration in (d). Though, at least on the surface, 
the relation of this configuration to the v, k, \ configuration is somewhat 
obscure, by relating both to their metric characterizations, it is immediately 
apparent that for 0 < 2\ = k < v, they are essentially equivalent. 

(f) Balanced incomplete block designs. Let T = {s,,...,5,}, and consider 
the configuration C = {7,,..., 17>}, where the 7; are subsets of 7. Then 
the dual configuration consists of the subsets A,,...,A,oftheset A = {t,,.. 


ty}, where ¢,; is in A, if and only if a; is in T,; and the complementary configura- 
tion consists of the sets 7;,..., 7, where 7, denotes the complement of 7°;. 

Given the set S(d) of 6 elements, let E, denote the class of all subsets of 
S(b) containing r elements. Then E, is a subspace of the metric space 
Usw). We will show that for 0 < \ <r < Bb, the existence of a balanced 
incomplete block design BIBD (26) with parameters }, v, k, r, \ is equivalent 
to having e(E,,r — A — 1) = (r — A)b/(r? — AB) with r? — Ad > O. 


THEOREM 3.1. Jf r? — 4b > O (A integral and 0 < X <r), then 

(a) e(ZE,,r —X — 1) < v, where v = (r — dA)b/(r? — XD). 

(b) Equality holds in (a) if and only if there exist v elements x1,...,X, in 
E, such that d(x;,x;) = r — x, for all i # j. 


Proof. Let e(E,,r — \ — 1) = m. Then since X is integral there exist 
X1,...+,%Xm in E, with d(x;,x,;) > r — d for all i # 7. Denote by k; the num- 


ber of sets x, containing the ith element of S(6), i = 1,..., 6. Comparing 
total occurrences we obtain 
(1) , - ky = rm. 


Comparing contributions to all 


set intersections, we obtain 


(2) ) 2 (#:) =D Vex) = LY lr - deo x,. 

















A METRIZATION FOR POWER-SETS 165 


But then d(x,, x,;) > r — A implies 


@) E () <a). 


Now from Lemma | (§ 4) and (1), (3) follows 


/b ky 
@ — Ena(”y*) <= G)<x-a(7), 


Using r? — \5 > 0, we find from elementary calculations that L < R is 
equivalent to m < v, and L = R if and only if m = v. Thus conclusion (a) 
is established. Finally, if m = v, from L = M = R and (2) it follows that 


LX lr-d (eu x,)] = a(™), 


and thus d(x,,x,) > r — Aimpliesr — d(x,, x,;) = 4. Hence d(x,,x,) = r — X, 
completing the proof of the theorem. 


CoROLLARY 1. For 0 < A <r < 8, the configuration x,..., x, is the dual 
of a BIBD with parameters 5, v, k, r,4, where k = rv/b. Conversely, given 
the BIBD and considering its dual configuration as a subspace of E,, one 
obtains e(E,,r —’A — 1) = v, r? — AB > O, and v = (r — A)B/(r? — AD). 


Proof. In the proof of the above theorem, from m = v we obtain L = M. 
This together with Lemma 1 (§ 4) and (1), givesk; = rv/b = k,fori = 1,..., 
b. Thus every element of S(b) occurs in k of the sets x;,...,: x,. Further, note 
that d(x;,x;) = r —X for all 1 #7 implies that every pair of distinct sets 
intersect in exactly 4 elements. Thus the first conclusion follows. Conversely, 
given the BIBD and considering its dual configuration as a subspace of E,, 
one obtains e(E,,r — > — 1) > v. Further, from 0 <A<r< 8b, and the 
well-known conditions rv = bk, A\(v — 1) = r(k — 1), it follows that r?—dAd>0 
and v = (r — A)b/(r? — Xd). But then from conclusion (a) of the theorem, 
we have e(E,,r —A — 1) =». 

Specializing to v, k, \ configurations, one obtains the interesting result: 


CoROLLARY 2. Jf v elements are arranged in v' > v sets of k elements each, 
such that every pair of distinct sets has at most elements in common, where 
\ < k?/v is a non-negative integer, then} > k(k — 1)/(v — 1). If equality holds, 
then v' = v and every pair of distinct sets has exactly \ elements in common. 


Thus for 1 < k < v, the arrangement constitutes a v, k, \ configuration. (In 
this event, it is interesting to note that from Corollary 1 one also obtains 
directly that each element occurs in exactly k sets.) 


Proof. From conclusion (a) of the theorem we have that if k* — Av > 0, 
then v <0’ < e(Ey,k —XA — 1) < (R — A)v/(R? — Av). But v < (Rk — A) 
(k? — dv) is equivalent to \ > k(k — 1)/(v — 1), and equality hold sin both 
expressions or neither. Finally, apply the proof of (b). 








166 ROBERT SILVERMAN 


4. Some theorems for metrized Cartesian product spaces. Let E; denote 
a subspace of the metric space S*(m,),i=1,...,¢. For x; = (am,...,@) in 
E,,4=1,...,6, let x%—q... %_ = WX, = (b;,..., bg), where b, = (G1,,..., G5), 
joil,...,k. If wx, = (),..., be), wy: = (C1,..., Ce), % and y; in Ey, 
define d(xx,, ry,;) in the usual manner as the number of subscripts j for which 
b,#¢c,j=1,...,k. 


Definition 9. The metric space 
E,xEst...xE, = rE, = (ax; x, € Ei} 


is termed the direct product of the E,. Note the distinction between the direct 
product #E, and the Cartesian product rE,. 


From the above definition, it is clear that any biunique mapping from 
aS(n,;) onto S(rn,) induces an isometry between #S*(n,;) and S*(rn,). For 
convenience we simply write #S*(n,) = S*(rn,). With this understanding, 
#E, is a subspace of S*(xn,). Note that the direct product is independent of 
the order of the factors. That is, if7(1), . . . , 7(¢) isa permutation of 1, 2,..., ¢, 
then +E, is isometric to Ey; under the mapping rx,;— rx». Next note 
that if 4; <te <... <t, =1t, then rE yyxtE yak... XE yw, is isometric to 
wE,, where i(j) = t)1+1,...,t; (¢: = 0). In particular it follows that 
(E,xE.)xE; and E,x(E2xE;) are isometric since each is isometric to E,\xE.x<E3. 
Thus, relative to isometry, the direct product operation is associative and 
commutative. 

THEOREM 4.1. For x, and y,; in E,, 

max|d(x,, y,)] < d(xx,, ry.) < Yod(x,, y,). 


Proof. From the definition it is clear that if x, and y, have distinct jth 
components, then so do rx, and ry,;. Hence d(x,, y,) < d(#x,, ry,) and the 
first inequality follows. Next suppose wx, and zy, have distinct jth com- 
ponents. Then so do at least one pair x,, y,, and from this it is clear that 
the second inequality must hold. 


COROLLARY 1. The major and minor diameters A, 6 satisfy 
max(6(E,)) < 6(#E,) < A(#E,) < TA(E,). 
Proof. From the definitions and the theorem we have 
5(E,) < d(x, yi) < max d(x, yi) < d(xx,, ry,), 
which implies max 5(E,) < 6(#E,). Also 
d(xx;, r¥:) < Did(xi, ¥) < SA(ED 
implies that A(#E,) < SA(E)). 


CorROLLARY 2. Jf S*(n;) and S*(nz) are r-orthogonal, then so is S*(nnz). 











A METRIZATION FOR POWER-SETS 167 


Proof. Let L, be an L(m,, k, r) space of S*(m,), i = 1,2. Then L = L,kL, 
is an L(mymo, k, r) space of S*(mynz), since 


5(L) > max[6(L;), 6(L2)] > k — +, 


rT 


and L contains exactly (mn2)’ elements. 


By identifying the element (a;,...,a,) with the set {(1,a,),..., (&, a)}, 
we can apply Theorem 3.1 and its first corollary to the metric space S*(n). We 
then obtain immediately 


THEOREM 4.2. If k > (r — 1)m (r integral, 2 < r,n > 1), then 

(a) e(S*(n),k — r) < v, where v = n[k — (r — 1))/[k — n(r — 1)]. 

(b) If equality holds in (a) there exist v elements x,,...,x, in S*(n) such 
that d(x,,x,) = k — (r — 1), for all i # j. In this event, each element of S(n) 
occurs as a jth component of exactly t = |k — (r — 1)|/[k — n(r — 1)] of the 
x,'s,j=1,...,k. Further, from x,,...,%x, we can construct a BIBD with 
parameters b’, v', k’, r', Nv (O< Xd’ <r <b’), where b' = kn, v' =, k' =t, 
Pf =k,’ =r — 1. 

This BIBD has the special property that its b’ blocks can be partitioned 
into k pairwise disjoint classes of m blocks each, such that every variety 
occurs in exactly one block from each class. Conversely, given such a BIBD, 
one can construct v elements of S*(m) with mutual distances exceeding 
k — r,and the elementary conditions on its parameters will imply k > (r—1)n. 


From Cauchy's inequality we obtain 


LEMMA |. For the real numbers a,,..., Gn, 


where 


a 
(2) = a,(a, — 1)/2 


and a = (>-a,)/n. Further, equality holds if and only if a, = a for all i. 


LEMMA 2. Let i(1),...,7(t), t <r, be any t distinct integers from among 
1,...,%, and let ayy be in S(n). Then in the L(n,k,r) space, L, there are 
precisely n’—* elements with i(j)th component equal to ay,,j7 = 1,...,t. 


Proof. The proof is evident from the fact that Z contains nm’ elements, 
any two distinct elements agree in at most r — 1 corresponding components, 
and over S(m) every t-tuple can be completed to an r-tuple in exactly n’~‘ 
ways. 


Lemma 3. If S*(n) is r-orthogonal, then S*-'(n) is (r — 1) -orthogonal. 














168 ROBERT SILVERMAN 


Proof. Let L be an L(n, k, r) space of S*(m). By Lemma 2 there are n’—'! 
elements in L having the same kth component. If their kth components are 
dropped, it is easy to see that the m’—-' elements so obtained constitute an 
L(n, k — 1,r — 1) space. 


Lemma 4. If S*(n) is r-orthogonal, then so is S'(n), t < k. 


Proof. This is clear from the fact that if the last k — ¢ components of the 
elements in an L(n, k, r) space are dropped, the resulting elements comprise 
an L(n, t, r) space. 


Lema 5. If S*(n) is r-orthogonal, r > 2, thnk Cn+r-—1. 


Proof. Let L be an L(n, k, r) space in S*(m). For a in S(n), by Lemma 2 
there are m elements x,,...,%, in L with first r — 1 components equal to a, 
and an element x,4; distinct from these and having its first r — 2 com- 


ponents equal to a. Thus from d(x,, x,;) > k — r for all 1 ¥ j, it follows that 


n>k—r+l. 


LemMA 6. For x and y in the L(n,n +r — 1,71) space, L, d(x, y) << n+ 1 
implies d(x, y) = n. 


Proof. Let x = (@1,...,@%), y = (b1,..., bh), R =n +r—1. Suppose 
d(x,y) <n+ 1. Then a, = 5, for at least r — 2 subscripts 7. With no loss 
of generality, suppose a; = b;,,1 = 1,...,r — 2. Now by Lemma 2 there 
are exactly nm? — 1 elements in L(n, k, r) which are different from x and have 
ith component equal to a; 71=1,...,7 —2. Let A, denote the subset 
of these n* — 1 elements having jth component equal toa,,j7 =r—1,...,k. 
Then again by Lemma 2, A , contains precisely » — 1 elements. Also, A; (\ A; 
= @ for i #j, and so VA, contains precisely (m — 1)(n +1) = n? — 1 
elements. Thus every element in Z having ith component equal toa;,i = 1,.., 
r — 2, has its jth component equal to a, for precisely one value of 7, r — 1 
<j < k, and so has distance m from x. In particular, d(x, y) = n. 


LEMMA 7. Given a k X r matrix, A, over a field F, having all its r-rowed minors 
non-singular, we can construct a k X (k — 1) matrix with all (k — r)-rowed 
minors non-singular. 


Proof. Let A; denote the r X r matrix consisting of the first r rows of A, 
and let Az denote the (k — r) X r matrix consisting of the remaining k — r 
rows, so that 


By hypothesis A, is non-singular, so by elementary operations on the columns 
of A we can obtain 








—_——. 





A METRIZATION FOR POWER-SETS 169 


where I is the r X r identity matrix. Since the elementary operations have 
been performed only on the columns of A, A’ also has all r-rowed minors non- 
singular. Now using the Laplace expansion, one sees that every minor deter- 
minant of A,’ occurs as a factor of some r-rowed minor determinant of A’, 
and hence is not zero. Again applying the Laplace expansion, one verifies 
that every (k — r)-rowed minor determinant of the (k — r) Xk matrix 
[A2’, J] either equals unity or has the same absolute value as some minor 
determinant of A,’ (here J is the (k — r)-rowed identity matrix). Hence 
[A’, J] has all (Rk — r)-rowed minors non-singular. Taking the transpose, we 
have the required result. 


THEOREM 4.3. Given a k X r matrix A = |a,,| over GF(p™) having all r-rowed 
minors non-singular, we can construct an L(n,k, r) and an L(n,k, k — r) space, 
n = p”. 

Proof. Denote the row vectors of A by a,...,a,. Let L = {Ax}, where x 
ranges over the n’ r-place column vectors over GF(p"). Then L is an L(n, k, r) 
space, for suppose Ax, Ay, x # y, have as many as r components the same, 
say i(1),...,72(r). Then the submatrix B of A consisting of the row vectors 
41), +++» &7) Satisfies Bx = By. But by hypothesis B is non-singular, sox = y 
contradicting our choice of x and y. Hence d(Ax, Ay) > k — r, and L is an 
L(n, k, r) space. Finally, by Lemma 7, from A we can construct ak X (k — r) 
matrix with all r-rowed minors non-singular, and applying the above proof 
we obtain an L(n,k, k — r) space. 

The first part of the above theorem corresponds to that of Bose and Bush 
(7, Theorem 5A, p. 521) with index one. They employ a similar proof. 


LEMMA 8. For n = p™, pa prime, 

(1) S*(n) is r-orthogonal fork <n + 1. 

(2) S*(n) is 3-orthogonal for k < n + 2 and p = 2. 
(3) S*(m) is r-orthogonal fork <r +1. 


Proof. Letting a;,...,@,-1 denote the non-zero elements of GF(p"), one 
readily verifies that the matrix 


B 
10...00 
o8...0 3 


where B = [b,,] is an (mn — 1) Xr matrix with b,, = a’, j7 = 0,...,7r —1, 
has all r-rowed minors non-singular, since their determinants all reduce to 
the Vandermonde type. For the special case p = 2, r = 3, we may adjoin 
to A as an (m + 2)th row the vector (0, 1,0) and again verify that A has 
all 3-rowed minors non-singular. Conclusions (1) and (2) now follow from 
Theorem 4.3 and Lemma 4. Finally, consider (3). For k = r, the result is 
trivial. For k = r+ 1, the matrix obtained by adjoining the row vector 











170 ROBERT SILVERMAN 


(1, 1,..., 1) to ther X r identity matrix has all r-rowed minors non-singular, 
and (3) follows from Theorem 4.3. 

Conclusions (1) and (2) of the above lemma correspond to the theorem 
of Bush (8, p. 431), who obtains the construction by employing polynomials 
over GF(p™). 


From Lemmas 4 and 8 and Corollary 2 of Theorem 4.1, we obtain 


THEOREM 4.4. If n = xp,** is the decomposition of n into distinct prime 
powers, then 

(1) S*(m) is r-orthogonal for k < min(p;** + 1). 

(2) S*(m) is 3-orthogonal for k < 2" + 2 if min(p,*¢ + 1) = 2" 4+ 1. 

(3) S*(n) is r-orthogonal for arbitrary n whenever k < r + 1. 

From the relation between orthogonal Latin squares and L(n, k, 2) spaces, 


for r = 2 we obtain the theorem proved in Mann’s book (18, Theorem 8.8, 
p. 105) (other construction methods may be found in (16, Theorem 12.1)): 


COROLLARY. There exist at least min(p,*‘ — 1) orthogonal Latin squares of 


side n = rp;,**. 


(It is of interest to note here that in a yet unpublished paper, E. T. Parker 
has, by an elegant construction, succeeded in exceeding the minimum given 
in the above corollary for certain values of nm. The author believes this to be 
the first such successful attempt.) 


THEOREM 4.5. Let L be an L(n,k,r) space, r>n—1, k >r +2. Then 
r =n — 1, and for every x in S*(n), d(x, L) << k — +. 


Proof. Let x = (x1,...,%,), and applying Lemma 2 with ¢ =r, let 
y = (a;,...,@,) be the unique element of L having a; = x, 1 =1,...,7. 
Let y’ = (a:,...,@,), and in S’(m) consider the unit closed sphere c(y’, 1). 
The sphere contains r(m — 1) elements y;’,... , ¥,'(:—1) different from y’, and 
by the triangle inequality, d(y,’, y,/) < 2, 1 # 7. Again by Lemma 2, to each 
yi = (bu, ..., bs) there corresponds a unique element y; in L, y; = (ba,..., 
bir, Us, Vy... ,). Now d(y, y,)) > Rk — r and d(y’, y/) = 1 imply u, # a,4,, 
0, © G42. Also, for i ¥ j, d(y:,¥;) > Rk —r and d(y/, y,/) < 2 imply that 
the ordered pairs (u;,%,) and (u,,v,;) are distinct. Thus there are r(m — 1) 
distinct ordered pairs (u;,v,) for which u, ¥ a,4; and v1; # 4,42. But since 
the total number of such pairs is (n — 1)*, we must have r(m — 1) < (m — 1)? 
or r <n — 1. Hence r = n — 1. 

We now prove the second part of the theorem. If x; = a, for either 
i =r+1orr-+ 2, we are done. So assume x; ¥ a, i = r+1,7+ 2. But 
then (x,+1, X-+2) must be one of the pairs (u;, v,), and the conclusion follows. 

As an immediate corollary, we obtain the equivalent of the theorem of 


Bush (8, p. 427): 
Coro.vary 1. Jf S*(m) is r-orthogonal, then r > n implies k <r + 1. 





eo fA -_ 4. 








A METRIZATION FOR POWER-SETS 171 


(Thus there are no Graeco-Latin squares of order 2, no Graeco-Latin cubes 
of order 3, etc.) 


Lemma 5 and the above corollary give 


COROLLARY 2. For 1 <r<k—1 and k > 2n — 1, S*(n) is not r-ortho- 
gonal. 


THEOREM 4.6. If S*(n) is r-orthogonal for r > 2 and k =n +r — 1, then 


end 


at ) = omod(t = 1) fort = 2,3,...44 


Proof. The proof is by induction on r. For r = 2 the theorem is trivial. 
Assume the theorem holds for r — 1, r > 3. Let L be an L(n, k, r) space, and 
let x = (a,,...,a,) bein L,k = n+r-— 1. By Lemma 2, L contains n’-* 
elements y,, 7 = 1,...,’~*, with first and second components },; and de, res- 
pectively, b; # a;, b2 ¥ ade. Also, for every set i;,..., 1,2 of r — 2 distinct inte- 
gers from among 3,4,...,” +r — 1, there is a unique element y(i, 2 
among the y,'s with i,th component equal toa,» (i(j) = i,),j = 1,..., r — 2. 
But then by Lemma 6, the distance of this element from x is n, and so among 
the last n + r — 3 components, y(i;,...,%,-2) has r — 1 components equal 
to the corresponding components of x. Hence there are 


distinct sets {j:,...,jr-2} associated with the same element y(%,... , 7,—2). 
Further, since x and y(t,...,%,-2) agree in at most r — 1 corresponding 
components, there can be no more than r — 1 such sets associated with 


WG, ess i,-2). Thus the 
ery 
r—2 
sets are divided into classes of r — 1 sets each, and we must have 
n+r-—3 
r—2 


Applying the induction hypothesis and Lemma 3, we obtain the theorem. 
From ¢ = 3 in the above, one obtains the theorem of Bush (8, p. 430): 


0 mod (r — 1). 


COROLLARY. For n odd, r > 3, an L(n,k, 1) space satisfies kk Cn +r — 2. 


Relative to our previous remarks (§ 3, Example (a)), from the above theorem 
and Lemma 8 it follows that complete sets of orthogonal Latin cubes always 
exist for m a power of 2, and never exist for m odd. However, for m an odd 
prime power > 5, we can always construct a complete set less one. 


THEOREM 4.7. If S"(n) is 2-orthogonal, then so is S"*'(n). 











172 ROBERT SILVERMAN 


Proof. Let L be an L(n, n, 2) space in S"(m), where S(m) = {a,,...,a,}. 


Denote by x:,...,, the m elements of Z having first component equal to 
a, (Lemma 2). Let x,’ in S"*'(m) be the element obtained by adjoining a, as 
an (nm + 1)th component to x; 1 = 1,...,m. Let y in L be different from 
the x,’s. Then since d(x;,x,;) = m — 1 for all i # j, each a, occurs exactly 
once as a tth component of the x,'s for t = 2,... , m. Hence d(x;, y) > n — 2 
for 7 = 1,...,m implies that there is a unique subscript m for which 


d(xm,y) = nm. Let y’ in S"*'(n) be the element obtained by adjoining a,, as 
an (m + 1)th component to y. Now repeat the above process for all y in L 
different from the x,'s, and denote by L’ the set of n* elements of S"*+!(m) which 
are thus obtained. By construction it is clear that x,’ has distance n from 
each of the other elements of L’. Let y,’ and y,’ be any two distinct elements 
of L’ different from the x;’s. If d(y:, ye) = m, then d(y,’, yo’) > n. If y; and 
y2 have the same first component b ¥ a, let ys, ... , ¥, denote the remaining 
elements of L with first component 6. Then applying the argument used 
above to the y,, for each x; there is a unique subscript m for which d(y,, x;) = n. 
Hence, by construction, no two of the y,’’s have the same ( + 1)th com- 
ponent, and so in particular, d(y,’, yo’) = n. Finally, if y; and ye have the 
same jth component 6, 2 <j <n, let ys,..., Yn-1, *, denote the remaining 
elements of ZL with jth component 6. Again applying the above argument, 
we obtain d(y;’, yo’) = n. Hence d(x’, y’) > m for all x’, y’ in L’, and so L’ 
is an L(n,n + 1,2) space and S"*'(m) is 2-orthogonal. 


The above theorem is equivalent to saying that every set of m — 2 ortho- 
gonal Latin squares of side » may be completed to a full set of m — 1 ortho- 
gonal Latin squares. From our remarks following Theorem 4.7, it is interesting 
to note that the corresponding theorem for cubes is false. 

From Lemma 3, Theorem 4.7, the Bruck-Ryser non-existence theorem (10) 
and the relations among orthogonal Latin squares, projective planes, and 
L(n, k, r) spaces, we obtain immediately: 


THEOREM 4.8. Jf nm = 1 or 2 (mod 4) and the square-free part of n is divisible 
by a prime of the form 4k + 3, then S*(n) is not r-orthogonal fork > n +r — 2, 
r > 2. 


THEOREM 4.9. If S*(n) is r-orthogonal, then it admits of a partitioning into 
pairwise disjoint, superposable L(n, k,r) spaces. 


Proof. Suppose S*(n) is r-orthogonal, and let L denote an L(n, k, r) space 
of S*(m). Let Ai,...,Ax-, be k — r Latin squares of side m, and denote by 
a(t, 7) the permutation ta, ¢ = 1,...,m, where (a;, do,...,@,) is the ith 
row vector of A;. Finally, let w = w(i,...,i—,) denote the mapping of 
S*(n) into itself generated by performing the permutation a(i,,7) upon the 
jth components of the elements of S*(m), 7 = 1,...,% —r. It is clear that 
w is indeed a motion (Definition 4), and so under w, L is carried into a super- 
posable L(n, k, r) space, L(i,..., %~,). Further, from Lemma 2 with ¢ = r, 

















A METRIZATION FOR POWER-SETS 173 


it is easy to see that if (%,..., i ~,) and (f:,... , je_,) are distinct as vectors, 
then L(i;,..., %--) and L(ji,..., je-r) are disjoint. Finally, since L(i;, . . 
ix_,) consists of n’ elements, and there are n*~’ distinct vectors (4;, .. 
it is clear that these spaces exhaust S*(m). 


‘ ¢ tena, 


THEOREM 4.10. For r > 1, L(n,k, 1) is a metric basis for S*(n). 


Proof. The theorem is trivial for k = r, so assume k > r. Let x and y be 
arbitrary in S*(m), x = (a;,...,@x), y = (b1,...,d), x # y. The theorem 
will be proved if it can be shown that (C): there exists z in L(m,k,r) such 
that d(x, z) < d(y,2). Proof is by induction on r. Consider first L(n, k, 2). 
Suppose, say, a; # };. Let 2;,..., 2%, be the m elements of L(n, k, 2) with first 
component equal to a,. If for some 2;, d(y, z;) = k, we are done since d(x, z,) 
< k — 1. So suppose d(y,2;) < k for 1 = 1,...,m. Then since L(n, k, 2) 
has minor diameter > k — 1, by Lemma 5 we must have k = n + 1, and 
hence d(y,2z;) =k—1, 7=1,...,m. Now let 2, be the unique element 
among the z; (Lemma 2) with second component equal to a». Then 
d(y,z:) = k — 1 and d(x, z,) < k — 2. This completes the proof for r = 2. 
Now suppose that (C) holds for r — 1(r > 3) and all k, and consider L(n, k, r). 
If d(x, y) < r, then by Lemma 2, we can always find a z in L(n, k, r) such 
that d(x, z) < d(y, z). So suppose d(x, y) > r > 3. Select the m’—' elements 
in L(n,k,r) with kth component equal to a,. Denote these elements by 
Z1,...,2, t= m"—', and let 2,;',...,2, be the corresponding elements of 
S*-'(n) obtained by dropping the kth components of the z,;. By the proof of 
Lemma 3, the z,;"s constitute an L(m, k — 1, r — 1) space, and by the induction 
hypothesis there exists z,/ in L(m,k — 1,r — 1) with d(x’, 2,/) < d(y’,2/), 
where x’ = (a;,..., @s-1), y = (b1,..., de-1). But d(x’, 2,/) = d(x, 2,), and 
d(y’, 2’) < d(y, 2;). Hence d(x, 2;) < d(y, 2;). 


5. Concluding remarks. The investigation of the metric properties of 
S*(n) and, in general, of power-sets has, of course, only its beginnings in the 
present paper. One of the initial problems is the discovery of further signifi- 
cant concepts (such as “‘extent’’ appears to be, for example), since many of 
the classical metric concepts apparently will have limited value, and topo- 
logical concepts become completely trivial for the finite spaces. High on the 
list of desiderata would be a development of the basic theory to the point 
where the elements, say, of S*(m) could be treated abstractly, making it 
unnecessary to deal with their internal structure each time a new result is 
under scrutiny. For it is precisely at the point where internal combinatorial 
structure becomes too complex for the mind to grasp as a totality that our 
efforts fail. 

A line of attack which has been neglected in the present paper and which 
may prove to be fruitful, is an examination of the distance matrix. One may 
readily obtain an indication of the manner in which some of the properties 
of S*(n) are reflected in its distance matrix A by going through the definitions 








174 ROBERT SILVERMAN 


and theorems and rephrasing them in terms of A. Of course, one of the critical 
questions in this regard is whether these and other significant properties of A 
lend themselves to matric methods and theory. Also of value may be an 
investigation of the behaviour of subspaces of S*(m) under motions of S*(m). 
(Any circle of radius 1, C = C(a,1) = {x;d(x,a) = 1}, is a metric basis 
for S*(n). Considering such a circle, it is not difficult to show that the group 
of motions of S*(m) is the semi-direct product of A by B, where A is the direct 
product of k symmetric groups on n letters, and B is the symmetric group on k 
letters.) For example, what can be said about the group of motions which 
carries an L(n, k,r) space into itself? 

In addition to these metric spaces being objects of interest in their own 
right, the results thus far obtained offer hope that this type of approach may 
provide a useful common orientation for a wide class of combinatorial prob- 
lems. 


Acknowledgments. The author wishes to express his sincere appreciation to 
his former adviser Professor D. R. Whitney for many helpful suggestions 
in the exposition, to Professors Marshall Hall, Jr., and H. B. Mann for taking 
time from their own busy schedules to lend a sympathetic ear, and most 
especially to Professors Whitney and H. J. Ryser for their continued en- 
couragement. Finally, the author wishes to thank the referee for calling his 
attention to recent developments. 


Addendum. Since the submission of this paper, there have been several 
developments in the field. The work of Bose, Parker, and Shrikhande has 
annihilated the Euler conjecture. It is now known that pairs of orthogonal 
Latin squares (L(n, 4, 2) spaces) exist for all orders except m = 2, 6. It will 
be interesting to see to what extent their construction techniques can be 
extended to general L(n, k,r) spaces. Remaining related problems are in a 
state of flux. Also, in a recent conversation the author learned from Professor 
Bose that the metric space S*(2) has been studied in connection with error 
correcting codes. (The metric space S*(2) is, of course, isometric with the 
set of vertices of a k-dimensional Euclidean hypercube of unit side, where the 
distance between vertices is taken as the square of the Euclidean distance. Also, 
S*(n) can be essentially embedded in S*"(2) in a trivial manner.) The elements 
are termed k-place messages and the metric is termed the Hamming dis- 
tance. It is important in the theory of symmetric binary codes to determine 
the t-extent of S*(2). Discussion of this problem and additional bibliography 
can be found in (28), along with an excellent summary of the status of the 
existence problem for Hadamard matrices. One of the main results in the 
above paper is that if we consider (4¢ — 1)-place messages having all mutual 
distances greater than or equal to 2t, then the existence of the maximum 
number, 4/, of such messages is equivalent to the existence of a symmetric 
BIBD with parameters v= 6=4t-—1, r=kR=2t-—1, X\=t-—1 (or 














A METRIZATION FOR POWER-SETS 175 


equivalently, to the existence of a Hadamard matrix of order 4#). This result 
may also be obtained as a corollary to Theorem 4.2 of the present paper 
by taking m = 2, k = 4t — l and r = 2t. The design derived from the resolv- 
able BIBD of the theorem by deleting one variety and all blocks not con- 
taining it, is precisely the symmetric BIBD obtained by Bose and Shrikhande. 
More generally, for m = 2, Theorem 4.2 may be rephrased: If we consider 
m-place messages having all mutual distances greater than or equal to d, then 
for d + 1 < m < 2d, the maximum number of such messages is less than or 
equal to 2d/(2d — m), and equality is attained if and only if there exists a 
BIBD with parameters 
m m—d _ 2m — 3d 


b=m, v=55 , r=m-—d, k= 
2d — m 


2d — m’ 2 


REFERENCES 


1. Reinhold Baer, Nets and groups, Trans. Amer. Math. Soc., 46 (1939), 110-141. 
2. W. W. Rouse Ball, Mathematical Recreations and Essays (New York, 1947). 
3. Grace E. Bates, Free loops and nets and their generalizations, Amer. J. Math., 69 (1947), 
499-550. 
4. Leonard M. Blumenthal, Distance geometries, University of Missouri Studies, vol. 13, 
no. 2 (1938). 
5. ———— Theory and applications of distance geometry (New York, 1953). 
6. R. C. Bose, On the application of the properties of Galois fields to the problem of construction 
of hyper-Graeco-Latin squares, Sankhya, 3 (1938), 323-338. 
7. R.C. Bose and K. A. Bush, Orthogonal arrays of strength two and three, Ann. Math. Stat., 23 
(1952), 508-524. 
8. K. A. Bush, Orthogonal arrays of index unity, Ann. Math. Stat., 23 (1952), 426-434 
9. R. H. Bruck, Finite nets, I. Numerical invariants, Can. J. Math., 3 (1951), 94-107. 
10. R. H. Bruck and H. J. Ryser, The nonexistence of certain finite projective planes, Can. J 
Math., 1 (1949), 88-93. 
11. S. Chowla and H. J. Ryser, Combinatorial problems, Can. J. Math., 2 (1950), 93-99 
12. Leonard Euler, Recherches sur une nouvelle espéce de quarrés magiques, Commentationes 
Algebraicae, Opera Omnia, series prima, 7 (1923). 
12.1 R.A. Fisher and F. Yates, Statistical tables for Biological, agricultural, and medical research 
(Ist ed., Oliver and Boyd, 1938). 
13. Marshall Hall, Jr., Projective planes, Trans. Amer. Math. Soc., 54 (1943), 229-277 
14. F. W. Levi, Finite geometrical systems (University of Calcutta, 1942). 
15. A. Lindenbaum, Contributicns a l'étude de l’espéce métrique, 1 Fund. Math. 8 1926), 
209-222. 
16. H. F. MacNeish, Euler squares, Ann. Math., 23 (1921), 221-227. 
17. H. B. Mann, On orthogonal Latin squares, Bull. Amer. Math. Soc., 50 (1944), 249-257. 
18. Analysis and design of experiments (New York, 1949). 
19. H. W. Norton, The 7 X 7 squares, Ann. Eugen., 9 (1939), 269-307 
20. R. E. A. C. Paley, On orthogonal matrices, J. Math. and Phys., 12 (1933), 311-320 
21. C. R. Rao, Hypercubes of strength d leading to confounded designs in factorial experiments, 
Bull. Calcutta Math. Soc., 38 (1946) 67-78. 
22. H. J. Ryser, Geometries and incidence matrices, Amer. Math. Monthly, 62 (1955), 25-31 
23. Esther Seiden, On the problem of construction of orthogonal arrays, Ann. Math. Stat., 25 
(1954), 151-156. 














176 ROBERT SILVERMAN 


24. W. L. Stevens, The completely orthogonalized Latin square, Ann. Eugen., 9 (1939), 82-93. 
25. G. Tarry, Le probléme de 36 officiers, Mathesis, 20 (1901) 
26. F. Yates, Incomplete randomized blocks, Ann. Eugen., 7 (1936), 121-140. 


Added in proof 


27. R. C. Bose and S. S. Shrikhande, On the falsity of Euler's conjecture about the non-existence 
of two orthogonal Latin squares of order 4t + 2, Proc. N. A. S., 45 (1959), 734-737. 

28. A note on a result in the theory of code construction, Inform. and Control, 2 (1959), 
183-194. 

29. On the falsity of Euler's conjecture for all orders exceeding 26, Amer. Math. Soc 
Notices, 6 (1959), 379. 

30. R. W. Hamming, Error detecting and error correcting codes, Bell System Tech. J., 29 (1950), 
147-160. 

31. E. T. Parker, Construction of some sets of pairwise orthogonal Latin squares, Amer. Math. 
Soc. Notices, 5 (1958), 815. 

32. Orthogonal Latin squares, Proc. Nat. Acad. Sci., 44 (1959), 859-862. 

33. Completion of disproof of Euler's conjecture, Am. Math. Soc. Notices, 6 (1959), 391 

34. M. Plotkin, Binary codes with specified minimum distance, Research Div. Rept. 51-20, 
University of Pennsylvania. 

35. R. R. Varsamov, The evaluation of signals in codes with correction of errors, Math. Rev., 20 
(1959), 262. 


Ohio State University 
and 
The National Bureau of Standards 





the mathematical expositions series 


THE FOUNDATIONS OF GEOMETRY 
by Gilbert de B. Robinson / $4.00 


NON-EUCLIDEAN GEOMETRY 
by H. S. M. Coxeter / $5.50 


THE THEORY OF POTENTIAL AND SPHERICAL HARMONICS 
by Wolfgang J. Sternberg and Turner L. Smith / $5.50 


THE VARIATIONAL PRINCIPLES OF MECHANICS 
by Cornelius Lanczos / $5.75 


TENSOR CALCULUS 
by J. L. Synge and A. Schild / $6.50 


THE THEORY OF FUNCTIONS OF A REAL VARIABLE 
by R. L. Jeffery / $6.00 


GENERAL TOPOLOGY 
by Waclaw Sierpinski. Translated and revised by C.C. Krieger / $7.50 


BERNSTEIN POLYNOMIALS 
by G. G. Lorentz / $5.75 


PARTIAL DIFFERENTIAL EQUATIONS 
by G. F. D. Duff / $6.50 


VARIATIONAL METHODS FOR EIGENVALUE PROBLEMS: AN 
INTRODUCTION TO THE METHODS OF RAYLEIGH, RITZ, 
WEINSTEIN, AND ARONSZAJN 

by S.H. Gould / $6.00 


DIFFERENTIAL GEOMETRY 

by Erwin Kreyszig 

Intended to meet the need for a text introducing advanced students in 
mathematics, physics, and engineering to the field of differential 
geometry, this book is self-contained, requiring only a knowledge of 
calculus. The material is presented in a simple but rigorous manner, 
and is accompanied by many examples. 


xiv + 352 pages 6 X 9 inches $8.50 


university of toronto press, toronto 5, Canada 





